managarten

till/managarten

Fork 0

mirror of https://github.com/Memo-2023/mana-monorepo.git synced 2026-05-20 04:53:38 +02:00

Commit graph

Author	SHA1	Message	Date
Till JS	233cf28cf2	fix(shared-llm): switch remote backend to non-streaming, drop credentials Diagnosis from the user's last test pinpointed the bug: mana-llm returns totalFrames=0 (no SSE frames at all) when called from the browser, but works perfectly when called via curl from the same host with the same payload. Two compounding causes: 1. credentials: 'include' in our fetch combined with mana-llm's CORS headers silently breaks the response body. This is the classic "Access-Control-Allow-Origin: * + Allow-Credentials: true" mismatch — browsers reject the response per spec but report it as a 0-byte success rather than an error. 2. Streaming over CORS adds a second layer of fragility. Even if credentials weren't an issue, the browser fetch API's response body for SSE under CORS depends on a specific combination of server headers we evidently don't have. Fix: drop both the streaming AND the credentials. - stream: false in the request body. Single JSON response per call, much friendlier to the browser fetch API. - No `credentials` field at all (default 'same-origin' for cross- origin requests = don't send cookies). mana-llm's API key middleware accepts anonymous requests, so we don't need to send any auth context. - Parse the response as `await res.json()` instead of streaming SSE chunks. Pull `choice.message.content` (or fall back to `choice.text` for legacy completions API responses). - Backwards-compatibility shim for `req.onToken`: if a caller registered a token callback (legacy chat-style streaming UX), fire it ONCE with the full content at the end. The current orchestrator + queue model never consumes per-token streams for remote tiers, so this is a degraded-but-equivalent path. The playground module uses its own client and isn't affected. Verified manually with curl: $ curl -X POST https://llm.mana.how/v1/chat/completions \ -H 'Content-Type: application/json' \ -d '{"model":"gemma3:4b","messages":[{"role":"user","content":"Hi"}],"max_tokens":50,"stream":false}' → returns clean JSON with `choices[0].message.content` populated. Same call with `stream: true` from the same host also works (full SSE frames come back). The bug really is browser+credentials specific, not a service bug. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 14:07:06 +02:00
Till JS	0450c86527	fix(shared-llm): SSE shape diagnostics + simpler title prompt + fragment detection User test on the mana-server tier showed Ollama gemma3:4b returning LITERALLY empty content for the title task, which is much weirder than the small browser model misbehaving. Three layered fixes plus diagnostics that will tell us what's actually happening over the wire next time. 1. remote.ts: SSE diagnostics + liberal field shape The mana-llm /v1/chat/completions endpoint claims OpenAI compatibility, but different upstream providers (Ollama, OpenAI, Gemini) wrap their token text in different field paths inside the SSE delta. Be liberal in what we accept: - choice.delta.content (canonical OpenAI) - choice.delta.text (some Ollama-compat shims) - choice.message.content (non-streaming response embedded in stream) - choice.text (legacy completion API) Plus: count totalFrames + dataFrames + capture firstFrameRaw + firstFrameParsed during the stream. When `collected` is empty at the end of the stream, dump all of that to console.warn so the next test session shows us exactly what mana-llm is sending. This is the only reliable way to debug "empty completion" without a network sniffer in the user's browser. 2. generate-title.ts: drop few-shot, use simple system+user prompt The previous few-shot prompt with three `Aufnahme: "..."\nTitel: ...` examples was apparently too much for Ollama gemma3:4b on the mana-server tier — it returned literal "" for reasons we don't fully understand (chat-template confusion with the embedded quotes? multi-section format? some quirk of how mana-llm formats the messages for Ollama?). Either way, the failure mode is clear. Replace with a minimal two-message format: - system: "Du erzeugst einen kurzen Titel (3-5 Wörter)..." - user: <transcript> Same instruction, much simpler shape. Bumped maxTokens 24 → 32 to give the model breathing room. 3. generate-title.ts: rules fallback detects sentence fragments Even when the LLM fails and we fall through to runRules, the previous heuristic for medium-length transcripts (10-20 words) would extract the first 7 words verbatim — which for a typical "Eine kleine Testaufnahme um zu sehen ob alles funktioniert" memo produces "Eine kleine Testaufnahme, um zu sehen, ob" as the "title". That's a sentence fragment ending mid-thought, not a title. Worse than "Memo vom 9. April 2026". Add a "looks like a sentence fragment" heuristic: if the last word of the extracted slice is a German stop-word or article (und/oder/wenn/ob/zu/um/der/die/das/ein/...) the result is clearly mid-clause. In that case fall through to dateLabel() instead of writing the fragment. Stop-word list is curated to 30 entries — common conjunctions, articles, prepositions, auxiliaries. Not exhaustive but catches the typical "first 7 words of a German sentence" failure mode. After this commit lands, the next test will surface in the console EITHER: - the actual delta shape mana-llm is using (so we know if our parser is wrong or if the model is genuinely silent) - a real LLM-generated title (if the simpler prompt worked) - "Memo vom <date>" via the rules fallback (if the LLM still fails but the rules fragment detection caught the bad slice) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 13:12:13 +02:00
Till JS	56065c8537	fix(mana/web): unwrap $state proxy in workbench-scenes Dexie writes Adding an app to a workbench scene threw DataCloneError. scenesState is a $state array, so current.openApps was a Svelte 5 proxy and spreading it into a new array left proxy entries inside; IndexedDB's structured clone refuses to serialise those. Snapshot before handing the array to patchScene / createScene so Dexie sees plain objects. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 00:44:00 +02:00

Author

SHA1

Message

Date

Till JS

233cf28cf2

fix(shared-llm): switch remote backend to non-streaming, drop credentials

Diagnosis from the user's last test pinpointed the bug: mana-llm
returns totalFrames=0 (no SSE frames at all) when called from the
browser, but works perfectly when called via curl from the same host
with the same payload. Two compounding causes:

1. credentials: 'include' in our fetch combined with mana-llm's
CORS headers silently breaks the response body. This is the
classic "Access-Control-Allow-Origin: * + Allow-Credentials: true"
mismatch — browsers reject the response per spec but report it
as a 0-byte success rather than an error.

2. Streaming over CORS adds a second layer of fragility. Even if
credentials weren't an issue, the browser fetch API's response
body for SSE under CORS depends on a specific combination of
server headers we evidently don't have.

Fix: drop both the streaming AND the credentials.

- stream: false in the request body. Single JSON response per call,
much friendlier to the browser fetch API.
- No `credentials` field at all (default 'same-origin' for cross-
origin requests = don't send cookies). mana-llm's API key
middleware accepts anonymous requests, so we don't need to send
any auth context.
- Parse the response as `await res.json()` instead of streaming
SSE chunks. Pull `choice.message.content` (or fall back to
`choice.text` for legacy completions API responses).
- Backwards-compatibility shim for `req.onToken`: if a caller
registered a token callback (legacy chat-style streaming UX),
fire it ONCE with the full content at the end. The current
orchestrator + queue model never consumes per-token streams for
remote tiers, so this is a degraded-but-equivalent path. The
playground module uses its own client and isn't affected.

Verified manually with curl:

$ curl -X POST https://llm.mana.how/v1/chat/completions \
-H 'Content-Type: application/json' \
-d '{"model":"gemma3:4b","messages":[{"role":"user","content":"Hi"}],"max_tokens":50,"stream":false}'
→ returns clean JSON with `choices[0].message.content` populated.

Same call with `stream: true` from the same host also works (full
SSE frames come back). The bug really is browser+credentials
specific, not a service bug.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-09 14:07:06 +02:00

Till JS

0450c86527

fix(shared-llm): SSE shape diagnostics + simpler title prompt + fragment detection

User test on the mana-server tier showed Ollama gemma3:4b returning
LITERALLY empty content for the title task, which is much weirder
than the small browser model misbehaving. Three layered fixes plus
diagnostics that will tell us what's actually happening over the
wire next time.

1. remote.ts: SSE diagnostics + liberal field shape

   The mana-llm /v1/chat/completions endpoint claims OpenAI
   compatibility, but different upstream providers (Ollama, OpenAI,
   Gemini) wrap their token text in different field paths inside
   the SSE delta. Be liberal in what we accept:
     - choice.delta.content   (canonical OpenAI)
     - choice.delta.text      (some Ollama-compat shims)
     - choice.message.content (non-streaming response embedded in stream)
     - choice.text            (legacy completion API)

   Plus: count totalFrames + dataFrames + capture firstFrameRaw +
   firstFrameParsed during the stream. When `collected` is empty at
   the end of the stream, dump all of that to console.warn so the
   next test session shows us exactly what mana-llm is sending. This
   is the only reliable way to debug "empty completion" without a
   network sniffer in the user's browser.

2. generate-title.ts: drop few-shot, use simple system+user prompt

   The previous few-shot prompt with three `Aufnahme: "..."\nTitel: ...`
   examples was apparently too much for Ollama gemma3:4b on the
   mana-server tier — it returned literal "" for reasons we don't
   fully understand (chat-template confusion with the embedded
   quotes? multi-section format? some quirk of how mana-llm formats
   the messages for Ollama?). Either way, the failure mode is clear.

   Replace with a minimal two-message format:
     - system: "Du erzeugst einen kurzen Titel (3-5 Wörter)..."
     - user: <transcript>
   Same instruction, much simpler shape. Bumped maxTokens 24 → 32
   to give the model breathing room.

3. generate-title.ts: rules fallback detects sentence fragments

   Even when the LLM fails and we fall through to runRules, the
   previous heuristic for medium-length transcripts (10-20 words)
   would extract the first 7 words verbatim — which for a typical
   "Eine kleine Testaufnahme um zu sehen ob alles funktioniert" memo
   produces "Eine kleine Testaufnahme, um zu sehen, ob" as the
   "title". That's a sentence fragment ending mid-thought, not a
   title. Worse than "Memo vom 9. April 2026".

   Add a "looks like a sentence fragment" heuristic: if the last
   word of the extracted slice is a German stop-word or article
   (und/oder/wenn/ob/zu/um/der/die/das/ein/...) the result is
   clearly mid-clause. In that case fall through to dateLabel()
   instead of writing the fragment.

   Stop-word list is curated to 30 entries — common conjunctions,
   articles, prepositions, auxiliaries. Not exhaustive but catches
   the typical "first 7 words of a German sentence" failure mode.

After this commit lands, the next test will surface in the console
EITHER:
  - the actual delta shape mana-llm is using (so we know if our
    parser is wrong or if the model is genuinely silent)
  - a real LLM-generated title (if the simpler prompt worked)
  - "Memo vom <date>" via the rules fallback (if the LLM still
    fails but the rules fragment detection caught the bad slice)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-09 13:12:13 +02:00

Till JS

56065c8537

fix(mana/web): unwrap $state proxy in workbench-scenes Dexie writes

Adding an app to a workbench scene threw DataCloneError. scenesState
is a $state array, so current.openApps was a Svelte 5 proxy and
spreading it into a new array left proxy entries inside; IndexedDB's
structured clone refuses to serialise those. Snapshot before handing
the array to patchScene / createScene so Dexie sees plain objects.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-09 00:44:00 +02:00

3 commits