managarten

mirror of https://github.com/Memo-2023/mana-monorepo.git synced 2026-05-14 22:41:09 +02:00

Author	SHA1	Message	Date
Till JS	7901a5df2f	docs(local-llm): document browser-local LLM stack and CSP requirements Add packages/local-llm/CLAUDE.md as the package-level reference for browser-local LLM inference. The package went through a non-trivial engine swap from WebLLM/Qwen to transformers.js/Gemma 4 E2B on 2026-04-08, and the bring-up surfaced enough sharp edges that the next person (or AI agent) touching this code will save real time having them written down in one place rather than re-discovering them error by error. Captured topics: - What the package is, what library/model is currently used, and the deliberate engine-agnostic API surface that lets future swaps stay contained to this package. - Why we chose transformers.js + Gemma 4 over staying on WebLLM (MLC compilation lag for new model architectures) and what the return path looks like once MLC ships Gemma 4 builds. - The seven CSP directives that browser-local inference needs and WHY each one is required: * script-src: 'wasm-unsafe-eval', cdn.jsdelivr.net, blob: * connect-src: huggingface.co + .huggingface.co + cdn-lfs-, *.hf.co + cas-bridge.xethub.hf.co (XET CDN), cdn.jsdelivr.net (for the WASM preload fetch) Including the subtle "jsDelivr is needed in BOTH script-src and connect-src" trap that produces identical-looking error messages for two distinct underlying causes. - The Vite SSR module-cache gotcha: CSP additions made in packages/shared-utils/security-headers.ts do NOT hot-reload across the workspace package boundary, while additions made directly in apps/mana/apps/web/src/hooks.server.ts do. Includes the diagnostic pattern (compare which additions show up in the next CSP error vs which don't) and the workaround (move them into hooks.server.ts via setSecurityHeaders options). - The two-step tokenization pattern that's mandatory for Gemma4Processor: apply_chat_template(tokenize:false) → string, then processor.tokenizer(text, return_tensors:'pt'). The collapsed apply_chat_template(return_dict:true) path looks shorter but produces a malformed input shape and crashes model.generate() deep inside the forward pass with "Cannot read properties of null (reading 'dims')" — opaque from the call site. - The transformers.js v4 quirk that model.generate() returns null (not a tensor) when a TextStreamer is attached. The streamer is the only stable text channel; the engine always attaches one and uses the streamer's collected text as the canonical output, with a chars/4 fallback for token counts. - API surface (Svelte 5 example), how to add a new model to the registry, deploy notes (no base image rebuild needed for local-llm changes alone, but IS needed if shared-utils CSP defaults change), browser cache semantics, and hard browser support requirements (WebGPU, ~1.5–2 GB VRAM for E2B q4f16, no CPU/WASM fallback). Also link to the new doc from the root CLAUDE.md Shared Packages table so people land on it from the standard discovery path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:27:50 +02:00
Till JS	e8423e7551	fix(local-llm): use two-step tokenization to fix Gemma 4 generate crash The previous attempt to fix the "Cannot read properties of null (reading 'dims')" chat error was incomplete: I only stopped passing the bogus return_tensor:'pt' option to apply_chat_template. The underlying issue was that apply_chat_template's all-in-one mode (return_dict:true) does not produce a proper Tensor-backed { input_ids, attention_mask } pair for multimodal-capable processors like Gemma4Processor — it returns a shape that has no .dims on input_ids, so model.generate() crashes deep inside the forward pass the moment it tries to read the sequence length. Switch to the documented two-step pattern from the Gemma 4 model card: call apply_chat_template with tokenize:false to get the formatted prompt as a plain string, then run that string through processor.tokenizer with return_tensors:'pt' to get a proper Tensor pair. The tokenizer's return_tensors option is the Python convention and IS supported by transformers.js's Tokenizer class (the API name collision between apply_chat_template's return_tensor boolean and Tokenizer's return_tensors string is one of those nasty spots where the JS port intentionally diverges from Python). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:19:24 +02:00
Till JS	7f1513b5a3	fix(local-llm): handle null model.generate() return + bogus return_tensor First end-to-end Gemma 4 inference attempt threw "Cannot read properties of null (reading 'dims')" the moment a chat message was sent. Two bugs piled on top of each other: 1. apply_chat_template() was being called with `return_tensor: 'pt'`, which is the Python `transformers` convention. transformers.js's equivalent option is just a boolean (the default), and the string 'pt' is unrecognized — older versions silently ignored it, but the v4 code path now produces a less predictable input shape when it sees the unknown value. Drop it. 2. model.generate() in transformers.js v4 returns null (not a tensor) when a streamer is attached. The previous engine code only attached a streamer if the caller passed an `onToken` callback, then unconditionally tried to slice the tensor return for token counting — which crashed because the chat tab DOES pass onToken for live streaming. The streamer collected the text fine, but generate() returned null and our tensor read blew up. Restructure so the streamer is always attached and is the canonical text channel. The tensor return is now only used for token counting when present, and falls back to a chars/4 estimate when it isn't, so the /llm-test UI still shows roughly meaningful prompt/completion counts on either v3 (returns tensor) or v4 (returns null with streamer). The user-facing GenerateResult.content now always comes from the streamer's accumulated string instead of decoding the tensor's sliced suffix, which is more robust across versions. Also wrap the model.generate() call in try/catch so that versions of transformers.js that throw at end-of-streaming (after the streamer has already delivered all tokens) don't lose the answer. We only re-throw if the streamer collected nothing. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:15:33 +02:00
Till JS	cc6ebee6f5	fix(csp): allow blob: in script-src so onnxruntime-web Worker can spawn After jsDelivr was allowlisted, transformers.js progressed one step further: it successfully fetched the ort-wasm-simd-threaded.asyncify.mjs loader, then tried to wrap it in a `URL.createObjectURL(new Blob([...]))` and instantiate the result as a multi-threaded Web Worker. The blob: URL scheme is treated as its own CSP source by browsers, so the existing script-src — which only allows 'self', specific HTTPS hosts, and 'wasm-unsafe-eval' — blocked it. Add `blob:` to the mana-web scriptSrc list. The blob: scheme always inherits the document origin (you can't `URL.createObjectURL` a Blob from another origin), so this allowlists nothing more than our own runtime-generated worker scripts. It does NOT loosen the protection against remote-script injection. Worth knowing for future debugging: when transformers.js or any WebGPU/onnxruntime-web stack hits "Failed to fetch dynamically imported module: blob:..." after a successful dynamic import from a CDN, the next CSP layer is always blob: in script-src. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:06:33 +02:00
Till JS	e4e3360ca8	fix(csp): move jsdelivr allowlist to mana-web hooks (Vite SSR cache workaround) The previous two attempts at allowlisting cdn.jsdelivr.net for transformers.js's onnxruntime-web loader landed in shared-utils security-headers.ts. The actual file change was correct (verified by grep), the commits got pushed, the live security-headers.ts on disk had the additions — but Vite's SSR module cache for cross-workspace- package imports kept serving the OLD compiled shared-utils to hooks.server.ts. Net effect: edits to hooks.server.ts hot-reloaded fine (proven by the *.hf.co connect-src additions showing up immediately) while edits to shared-utils/security-headers.ts did not. A dev server restart should clear it but I'd rather not depend on manual intervention every time we touch the shared CSP. Move the jsdelivr allowlist out of the shared default and into mana-web's hooks.server.ts via the existing scriptSrc + connectSrc options. hooks.server.ts is in the SvelteKit app's own source tree so it HMRs reliably, no SSR cache to fight. As a bonus this is also architecturally cleaner: cdn.jsdelivr.net is only needed by mana-web because mana-web is the only Mana app that bundles @mana/local-llm — other apps get a slightly tighter CSP for free. The pattern to remember: changes to packages/shared-utils that affect SSR (response headers, server hooks) require either a dev server restart OR a manual `rm -rf apps/.../node_modules/.vite` to take effect. Client-side changes hot-reload fine. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:03:38 +02:00
Till JS	96023394b5	refactor(mana/web): extract QuickActionsList molecule shared by both widget variants Two QuickActionsWidget files lived in parallel under different widget systems — `dashboard/widgets/` (the user-customizable dashboard, i18n keys, 3 actions: credits/feedback/profile) and `core/widgets/` (the mana home screen, hardcoded German strings, 5 actions: todo/calendar/ contacts/context/times). The two rendered the same shape character- for-character: optional emoji-prefixed title + a list of rounded-card links each with icon + label + description. Only the data and a slightly different padding/icon sizing differed. Extract <QuickActionsList> in $lib/components that takes the actions array directly (consumers resolve i18n before passing in). Both widget files become thin wrappers — the dashboard one resolves $_(...) keys and passes the result, the core one passes its hardcoded data with `compact` set. LOC: 110 → 102 across the 3 files (-8 net, plus the shared 70-LOC molecule). Small numerically, but the bigger win is that future changes to the link layout (hover state, padding, icon style) happen once instead of twice — and the two widget files no longer accidentally drift in sizing/spacing. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:02:09 +02:00
Till JS	8772a9b9e1	fix(csp): also allow jsdelivr in connect-src for transformers.js WASM preload The earlier fix added cdn.jsdelivr.net to script-src so the dynamic import() of onnxruntime-web's loader .mjs would resolve. But that's only half the story: transformers.js also issues plain fetch() calls to PRE-LOAD the .wasm binary and the .mjs factory before the backend selection code path is even reached. fetch() is governed by connect-src, not script-src, so the wasm preload was still blocked with "Failed to pre-load WASM binary: TypeError: Failed to fetch". The visible downstream symptom was identical to the previous bug ("no available backend found. ERR: [webgpu] TypeError: Failed to fetch dynamically imported module"), which made it look like the script-src fix hadn't taken effect. Add cdn.jsdelivr.net to the default connect-src too, alongside the existing script-src entry, with a comment explaining why both are required. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:59:52 +02:00
Till JS	b50a5c9ac7	fix(local-llm): allow jsdelivr in CSP + aggregate transformers.js progress Two issues hit while loading Gemma 4 E2B in /llm-test for the first time on a local dev server. 1. CSP script-src blocked cdn.jsdelivr.net. @huggingface/transformers v4 lazy-loads the onnxruntime-web WASM loader shim via a runtime dynamic `import()` from cdn.jsdelivr.net/npm/onnxruntime-web@... at backend selection time (the package itself is bundled, but the WASM-loader is fetched on demand so the static bundle stays small). With the previous CSP the import was blocked and "no available backend found" was the only downstream error. Allowlist cdn.jsdelivr.net in the shared CSP script-src so every Mana web app picks this up automatically. 2. Loading bar oscillated wildly during the model download. transformers.js downloads many shards in parallel (config.json, tokenizer.json, generation_config.json, model.onnx, model_data.bin, …) and fires the progress callback per file. The previous engine code reported the latest event verbatim, so the bar bounced between whichever file happened to be progressing fastest. Replace per-file reporting with a Map<file, {loaded, total}> accumulator and emit an aggregated total on every event. The denominator can grow as new files are discovered (causing brief small dips), but both numerator and denominator are individually monotonic, so the aggregate is much smoother. Also include a human-readable byte count and file count in the status text: Downloading model (47%, 240 MB / 510 MB, 8 files) Pin completed files to 100% on the 'done' event so the final aggregate visibly hits 100% before the loading→ready transition. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:56:52 +02:00
Till JS	243c09d97c	refactor(mana/web): extract useItemContextMenu helper for ListView right-click menus Nine files (8 ListViews + todo's TaskList) reimplemented the same context-menu state machinery character-for-character: a typed $state object with visible/x/y/<itemKey>, a handleItemContextMenu function that calls preventDefault and stuffs the click position in, and a close handler that resets the entity field. Extract `useItemContextMenu<T>()` in $lib/data/item-context-menu.svelte that returns a reactive handle with `.state` (visible/x/y/target), `.open(e, target)`, and `.close()`. Consumers derive their menu items from `ctxMenu.state.target` and pass `ctxMenu.close` directly to <ContextMenu onClose>. Per file: ~10 LOC of state declaration + handler removed; consumer items array switches from `ctxMenu.<entity>` to `ctxMenu.state.target`. Across the 9 files this is ~−90 LOC of pure boilerplate; helper itself is 50 LOC. Net small (~−40 LOC) but the boilerplate is gone and the shape is one helper away from being adjustable globally. Note: shared-ui already exports a `createContextMenuState` factory, but it's a plain default-value object — not a Svelte 5 reactive helper. This new wrapper composes with the existing `ContextMenuState<T>` type from shared-ui rather than replacing it. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:55:31 +02:00
Till JS	0af9094096	fix(csp): allow HF XET CDN (cas-bridge.xethub.hf.co) for transformers.js After the WebLLM → transformers.js migration, the first attempt to load Gemma 4 E2B in /llm-test was blocked by CSP at the weight shard download step (tokenizer.json got through fine — it lives on huggingface.co directly). HF has rolled out a new XET-backed CDN for large model files at cas-bridge.xethub.hf.co, served from .xethub.hf.co (the parent zone is hf.co, NOT huggingface.co — so our existing wildcard `.huggingface.co` did not cover it). Open the broader hf.co wildcard (`https://*.hf.co`) so future XET host rotations don't bite us, plus the explicit cas-bridge.xethub.hf.co entry for older CSP-strict browsers that want narrower matches first. The legacy huggingface.co + cdn-lfs.huggingface.co entries stay in place for repo metadata and any model still on the old LFS path. Update the comment block above the CSP additions to reflect that the package now uses transformers.js + ONNX shards rather than the old WebLLM/MLC path, including a quick map of which HF domain serves what. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:51:50 +02:00
Till JS	63a91e36a2	fix(research): use /v1/chat/completions for mana-llm (not /api/v1/) End-to-end testing surfaced a 404 from the synth path. mana-llm (services/mana-llm/src/main.py) mounts the OpenAI-compatible API at /v1/* — there's no /api prefix. The first quick-depth e2e run only worked because the planner is skipped on quick (it just uses the question itself), so llmJson never fired; only llmStream did, and the streaming path also used the wrong prefix but the test happened to land before this was caught. The other apps/api modules (chat, guides, context, traces) all use the wrong /api/v1/ path too — that's a separate, pre-existing bug to be addressed in their own commits. Verified by re-running a standard-depth research run end-to-end against mana-llm pointed at the GPU server's ollama with gemma3:4b/12b: plan + retrieve + extract + synth all succeed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:37:07 +02:00
Till JS	30766f153e	fix(macmini): auto-rebuild stale sveltekit-base before per-app web builds NOTE: the previous commit `048184bef` carried this commit message but accidentally bundled an unrelated PickerOverlay refactor instead of this script change (lint-staged stash interaction). This is the actual fix. Per-app web Dockerfiles do `FROM sveltekit-base:local` and do NOT re-COPY packages/shared-* — those packages are baked into the base image. So a change to packages/shared-utils, packages/shared-ui, etc. only reaches the live web app if the base image is also rebuilt. This bit us THREE times on 2026-04-08 alone: 1. CSP fix in shared-utils ('wasm-unsafe-eval') sat unused in production for over an hour because every `build-app.sh mana-web` reused the cached base layer with old shared-utils. 2. The BaseListView export in shared-ui after the ListView consolidation refactor — mana-web's build failed because Rollup couldn't resolve the new symbol from the stale base. 3. Same shape, different package, repeatedly during the Gemma 4 migration push. The pattern is identical every time and the manual workaround (`build-app.sh --base` first) is something you only think to run if you already know how the layering works. Make the script catch it. New `is_base_image_stale` helper compares the base image's `Created` timestamp against the latest git commit touching paths the base image actually depends on (packages/, docker/Dockerfile.sveltekit-base, pnpm-lock.yaml). When building any *-web service, if the image is stale or missing, the base is rebuilt automatically before the per-app build kicks off, with the triggering commit's oneline printed for transparency. Date parsing handles macOS Docker's local-TZ-offset RFC3339 format (`...+02:00`, not Z). We strip from char 19 onward and parse the literal local clock time with BSD date (no -u). GNU date is the fallback for Linux dev boxes. If parsing fails for any reason we conservatively force a rebuild rather than risk shipping stale code. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:35:46 +02:00
Till JS	048184bef0	fix(macmini): auto-rebuild stale sveltekit-base before per-app web builds Per-app web Dockerfiles do `FROM sveltekit-base:local` and do NOT re-COPY packages/shared-* — those packages are baked into the base image. So a change to packages/shared-utils, packages/shared-ui, etc. only reaches the live web app if the base image is also rebuilt. This bit us THREE times on 2026-04-08 alone: 1. CSP fix in shared-utils ('wasm-unsafe-eval') sat unused in production for over an hour because every `build-app.sh mana-web` cheerfully reused the cached base layer that still contained the old shared-utils. 2. Same problem with the BaseListView export in shared-ui after the ListView consolidation refactor — mana-web's build failed because the Rollup pass couldn't resolve the new symbol from the stale base. 3. Same shape, different package, repeatedly. The pattern is identical every time and the manual workaround (`build-app.sh --base` first) is something you only think to run if you already know how the layering works. Make the script catch it. New `is_base_image_stale` helper compares the base image's `Created` timestamp against the latest git commit touching paths the base image actually depends on: - packages/ (all shared-* packages baked in) - docker/Dockerfile.sveltekit-base - pnpm-lock.yaml (transitive dep changes) When building any *-web service, if the image is stale or missing, the base is rebuilt automatically before the per-app build kicks off, with the triggering commit's oneline printed for transparency. Date parsing notes: - macOS Docker emits the Created field with the local TZ offset ("...+02:00"), not Z. We strip the fractional + offset suffix and parse the literal local clock time with BSD date (no -u), which is what the original timestamp meant on this host. GNU date is the fallback for Linux dev boxes and handles the full ISO directly. - If parsing fails for any reason we conservatively force a rebuild rather than risk shipping stale code. Verified end-to-end against the live Mac Mini's current state earlier today: image 55s newer than the last packages/ commit at the time → "fresh, skip" (correct). When the next packages/ commit lands, the script will see commit_epoch > image_epoch and trigger the base rebuild automatically. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:34:29 +02:00
Till JS	3dc16bb885	refactor(mana/web): extract addTagId / removeTagIdWithUndo helpers Four ListViews (calendar, contacts, places, todo) reimplemented the same drag-drop tag append logic, and four matching DetailViews reimplemented the same "remove tag with undo toast" logic. Extract both into pure helpers in $lib/data/tag-mutations.ts that take a store-agnostic update function — works for the standard tagIds modules and for todo's metadata.labelIds via tasksStore.updateLabels. Side win: places/views/DetailView's removeTag had no undo toast (every other module did). Consolidating fixes the inconsistency. zitare is the outlier — its drag target is a Quote, but the tags live on a (possibly-not-yet-existing) Favorite record. Stays custom. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:26:15 +02:00
Till JS	1f26aa4f2f	feat(local-llm): swap WebLLM/Qwen for transformers.js + Gemma 4 E2B Replace the entire @mana/local-llm engine with a transformers.js-based implementation backed by Google's Gemma 4 E2B (released 2026-04-02). The external API of LocalLLMEngine — load(), generate(), prompt(), extractJson(), classify(), onStatusChange(), isSupported() — is preserved 1:1, so the /llm-test page, the playground module, and the Svelte 5 reactive bindings in svelte.svelte.ts need no changes beyond updating the default model key. Why the engine swap: MLC has not (and as of today still hasn't) published Gemma 4 builds for WebLLM. The webml-community team and HuggingFace's onnx-community already have Gemma 4 E2B running in the browser via transformers.js + WebGPU, with a documented Gemma4ForConditionalGeneration class shipped in @huggingface/transformers v4.0.0. Going through the ONNX route gets us the latest Google model six days after release instead of waiting on MLC compilation. Trade-offs accepted (discussed before this commit): - transformers.js is a more generic ONNX runtime, so per-token throughput will be ~20-40% lower than WebLLM would deliver for the same model size. For a 2B model on a modern WebGPU device that's still well above interactive latency. - The JS bundle gains ~2-3 MB (the ONNX runtime). Negligible compared to the 500 MB model download. - transformers.js v4 is brand new (released alongside Gemma 4) so the Gemma4ForConditionalGeneration code path has very little battle testing yet. The risk is partially offset by webml-community's reference implementation. What changed file by file: - packages/local-llm/package.json: drop @mlc-ai/web-llm, add @huggingface/transformers ^4.0.0; bump version 0.1.0 → 0.2.0; rewrite description. - packages/local-llm/src/types.ts: add `dtype` field to ModelConfig ('fp32' \| 'fp16' \| 'q8' \| 'q4' \| 'q4f16') so each model can request the quantization that matches its uploaded ONNX shards. - packages/local-llm/src/models.ts: replace the old Qwen 2.5 + Gemma 2 registry with a single `gemma-4-e2b` entry pointing at onnx-community/gemma-4-E2B-it-ONNX with q4f16 quantization. Future models can be added by appending entries — the /llm-test picker reads MODELS dynamically and picks them up automatically. - packages/local-llm/src/cache.ts: replace the WebLLM-specific hasModelInCache helper with a generic Cache API probe that looks for `https://huggingface.co/{model_id}/resolve/main/tokenizer.json` in any open cache. tokenizer.json is small, downloaded first, and always present, so its presence is a reliable proxy for "model has been loaded before". - packages/local-llm/src/engine.ts: full rewrite. Internally we now hold a transformers.js model + processor pair (created via AutoProcessor.from_pretrained + Gemma4ForConditionalGeneration.from_pretrained with `device: 'webgpu'`), and translate our LoadingStatus union from the library's `progress_callback` shape. generate() applies Gemma's chat template via the processor, runs model.generate() with optional TextStreamer for streaming, then slices the prompt tokens off the output tensor to compute per-call usage. The convenience methods (prompt, extractJson, classify) are unchanged because they only call generate() under the hood. - packages/local-llm/src/generate.ts and status.svelte.ts: deleted. These were orphaned from a much earlier engine API (referenced `getEngine()` / `subscribe()` / `LlmState` symbols that haven't existed for a while) and were never re-exported from index.ts — they only showed up because `tsc --noEmit` was crawling the src tree. Their functionality lives in engine.ts + svelte.svelte.ts now. - apps/mana/apps/web/package.json: swap the direct dep from @mlc-ai/web-llm to @huggingface/transformers. This is the same trick we used for the previous adapter-node externals warning — having it as a direct dep makes adapter-node's Rollup pass treat it as external automatically. - apps/mana/apps/web/vite.config.ts: swap ssr.external entry from @mlc-ai/web-llm to @huggingface/transformers. Add a comment explaining the why so the next person doesn't wonder. - apps/mana/apps/web/src/routes/(app)/llm-test/+page.svelte: change the default selectedModel from 'qwen-2.5-1.5b' to 'gemma-4-e2b'. All other model display strings come from the MODELS registry, so this is the single hard-coded reference that needed updating. - pnpm-lock.yaml: regenerated. Confirmed @mlc-ai/web-llm is gone (0 references) and @huggingface/transformers is in (4 references). CSP: no header changes needed. We already opened connect-src for huggingface.co + cdn-lfs.huggingface.co + raw.githubusercontent.com when fixing the WebLLM blockers earlier today, and 'wasm-unsafe-eval' is already in script-src — both transformers.js (ONNX runtime) and WebLLM (MLC runtime) need that. If transformers.js spawns its inference into a Web Worker via a blob URL we may need to add `worker-src 'self' blob:` once we hit the first runtime test, but the existing CSP should be enough for the synchronous path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:22:32 +02:00
Till JS	83828e5a44	fix(research): handle zero-hit retrieval — skip empty insert + graceful summary Smoke-testing /api/v1/research/start with mana-search down surfaced a crash: drizzle's .values([]) throws "values() must be called with at least one value", which dropped the run into status='error' even though the failure is a perfectly normal "no results" case. Two changes: - Guard the sources insert behind enriched.length > 0 - If retrieval returns nothing, short-circuit straight to status='done' with an explicit German "keine Quellen gefunden" summary instead of feeding an empty corpus to the synthesiser The same path also triggers when every sub-query genuinely returns no results (very specific question, niche domain) so this isn't just an ops-failure case. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:19:17 +02:00
Till JS	e82851985b	feat(questions): deep-research module — mana-search + mana-llm pipeline End-to-end deep-research feature for the questions module: a fire-and- forget orchestrator in apps/api that plans sub-queries with mana-llm, retrieves sources via mana-search (with optional Readability extraction), and streams a structured synthesis back to the web app over SSE. Backend (apps/api/src/modules/research): - schema.ts: pgSchema('research') with research_results + sources - orchestrator.ts: three-phase pipeline (plan / retrieve / synthesise) with depth-aware config (quick=1×, standard=3×, deep=6× sub-queries) - pubsub.ts: in-process event bus, single-node, swappable for Redis - routes.ts: POST /start (202, fire-and-forget), GET /:id/stream (SSE), POST /start-sync (test only), GET /:id, GET /:id/sources - Credit gating via @mana/shared-hono/credits — validate up-front, consume best-effort on `done`. Failed runs cost nothing. Helpers (apps/api/src/lib): - llm.ts: llmJson() + llmStream() over mana-llm OpenAI-compat API - search.ts: webSearch() + bulkExtract() over mana-search Go service - responses.ts: shared errorResponse / listResponse / validationError Schema deployment: - drizzle.config.ts (research-scoped) + drizzle/research/0000_init.sql hand-authored migration, deployable via psql -f or drizzle-kit push. - drizzle-kit added as devDep with db:generate / db:push scripts. Web client (apps/mana/apps/web/src/lib/api/research.ts): - Typed start() / get() / listSources() / streamProgress(). The stream uses fetch + ReadableStream (not EventSource) so we can attach the JWT via Authorization header. Special-cases 402 for friendly toast. - New PUBLIC_MANA_API_URL plumbing in hooks.server.ts + config.ts. Module store (modules/questions/stores/answers.svelte.ts): - New write-side store with createManual / startResearch / accept / softDelete. startResearch creates an optimistic empty answer, opens the SSE stream, debounces token deltas in 100ms batches into the encrypted local row, and on `done` replaces the streamed text with the parsed { summary, keyPoints, followUps } payload + citations resolved against research.sources.id. Citation rendering (modules/questions/components/AnswerCitations.svelte): - Tokenises [n] markers in the answer body into clickable pills with hover popovers showing title / host / snippet / external link. - Lazy-loaded via a session-scoped source cache (stores/sources.svelte.ts) that deduplicates concurrent fetches. UI (routes/(app)/questions/[id]/+page.svelte): - Recherche card with three-state button (start / cancel / re-run), animated phase indicator, source counter. - Confirmation dialog warning about web/LLM transmission since the question itself is locally encrypted. - Toasts for success / error / cancel via @mana/shared-ui/toast. - Re-run flow soft-deletes prior research-driven answers but keeps manual ones intact. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:15:35 +02:00
Till JS	30787e36d2	refactor(mana/web): consolidate DetailView scaffolding into DetailViewShell + useDetailEntity Every module's inline-editable DetailView reimplemented the same plumbing: liveQuery → optional decryptRecord → reset on id change → focused/confirmDelete state → save-on-blur → deleteWithUndo via toastStore. Plus ~150 LOC of duplicated scoped CSS for the .detail-view / .title-input / .properties / .prop-row / .section / .danger-zone style track. Extract two pieces: - useDetailEntity (svelte runes module, $lib/data/detail-entity.svelte.ts) handles the JS plumbing: liveQuery + optional decrypt + reset on id change + focused/confirmDelete state + deleteWithUndo. Supports a custom `loader` for cross-table joins (events+timeBlocks, timeEntries+timeBlocks, tasks+timeBlocks). - DetailViewShell ($lib/components/DetailViewShell.svelte) handles the visual scaffold: outer flex column with scroll, loading/not-found state, body snippet, danger zone with confirm flow. Exports the shared field/property/section/meta classes as :global so consumer snippets can use them without redefining. Migrated 16 of the 18 DetailViews. Skipped: - zitare: no DB entity (quotes from bundled @zitare/content), no edit/delete flow. - events: different page shape (centered max-width, edit/view modes, eventId via direct prop instead of params, nested guest list / RSVP sections). Side wins: - 6 encrypted modules (storage, uload, music, questions, calendar, todo) now route their decrypt logic through one path instead of six separate `liveQuery + decryptRecord({ ...raw })` variations. - times/views/DetailView had the same latent type bug as the ListView (reading .date / .startTime / .endTime / .source off LocalTimeEntry, which doesn't define them). Now uses toTimeEntry() via the loader option for the joined TimeEntry shape. Net impact: ~3640 LOC removed across the 16 files (~49% reduction), ~510 LOC added for shell + helper. Net ~3130 LOC saved. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 22:10:42 +02:00
Till JS	c3cb9dd533	refactor(mana/web): consolidate ListView scaffolding into BaseListView Every workbench-style module ListView reimplemented the same liveQuery + filter + scroll-area + empty-state shell. Extract a shared <BaseListView> in @mana/shared-ui (with toolbar/header/ listHeader/item/empty snippets) and migrate the 17 modules whose list templates fit the workbench tailwind track. While here: - migrate DeckCard onto the existing (previously unused) shared Card atom from shared-ui/atoms. - fix a latent type bug in times/ListView: it was reading .date / .startTime / .isRunning off LocalTimeEntry, which doesn't define them. Now uses the proper joined TimeEntry via toTimeEntry() like the rest of the times module. Modules with their own scoped-CSS layout track (calendar, finance, contacts, notes, places, todo, photos, habits, automations, dreams, cycles) and outliers (calc, events, playground, zitare) are left alone — migrating them would be a visual rewrite, not a structural shell swap. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 18:40:47 +02:00
Till JS	d941ff2231	fix(mana-auth): account lockout was structurally dead + add failure-path tests While adding negative-path integration tests for the auth flow I discovered that neither of the lockout primitives in services/mana-auth/src/services/security.ts has actually been working in production. Two independent silent failures that combined into a "the lockout never triggers, ever" outcome: 1. recordAttempt() inserted into auth.login_attempts with explicit `id = gen_random_uuid()`, but auth.login_attempts.id is a `serial integer` column with `nextval('auth.login_attempts_id_seq')` as default. The UUID-into-integer cast threw a type error every single time, the bare `catch {}` swallowed it as "non-critical", and not a single login attempt was ever persisted. Lockout's "5 failures in 15 min" check was running against an empty table. 2. checkLockout() built `attempted_at > ${new Date(...)}` via the drizzle sql template, but postgres-js cannot bind a JS Date object directly — it tries to byteLength() the parameter and crashes with `Received an instance of Date`. Same anti-pattern: bare `catch`, returns `{locked: false}` (fail-open), no log, completely invisible. Both are "silent broken since the encryption-vault series of changes" class — caught only because the integration test for the lockout flow expected the 6th login attempt to return 429 and got 200 instead. Fixes: - recordAttempt(): drop the bogus `id` column from the INSERT (let the sequence default assign it), default ipAddress to null instead of letting `${undefined}` collapse the parameter slot, and surface errors in the catch instead of swallowing them silently. - checkLockout(): pass `windowStart.toISOString()` instead of the Date object so postgres-js can serialize it. Same catch upgrade — log the cause when failing open. Failure-path test additions (tests/integration/auth-failures.test.ts): - wrong password: assert 401, no JWT, +1 LOGIN_FAILURE in security_events, +1 row in auth.login_attempts - account lockout: 5 failed attempts then 6th returns 429 with remainingSeconds, even with the correct password - unverified email login: 403 with code = EMAIL_NOT_VERIFIED - validate with garbage token: valid !== true - resend verification: second mail arrives in mailpit Plus the run-integration-tests.sh helper now runs both .test.ts files and tests/integration/package.json's `test` script does the same. Negative-control: reverted the recordAttempt fix (re-added the bogus gen_random_uuid id), the wrong-password test failed at the login_attempts assertion. Reverted the checkLockout fix, the lockout test failed at the 429 assertion. Both fixes verified to be load-bearing. 6 tests, 45 expects, ~1.3s on a warm cache.	2026-04-08 18:29:00 +02:00
Till JS	a9178ec2fb	fix(docker): drop stale shared-subscription-* COPY lines from sveltekit-base The base image referenced packages/shared-subscription-types and packages/shared-subscription-ui, which were consolidated into packages/subscriptions a while back and no longer exist on disk. `build-app.sh --base` therefore failed every time with: failed to compute cache key: "/packages/shared-subscription-ui": not found That latent failure was harmless until today: the CSP fix for WebLLM in @mana/shared-utils never made it into the live mana-web container because shared-utils lives inside sveltekit-base:local (not COPYed by the per-app Dockerfile), and rebuilding the base was impossible. With the stale lines removed the base image rebuilds, picks up the current shared-utils, and downstream apps inherit the fixed CSP automatically. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 18:28:59 +02:00
Till JS	c5e5963cbe	fix(macmini): repair container auto-recovery (broken --env-file path) Two unrelated bugs in scripts/mac-mini/ensure-containers-running.sh, both caught while debugging a mana-auth crash loop on 2026-04-08: 1. The recovery path passed --env-file "$PROJECT_ROOT/.env.macmini" to docker compose, but that file has never existed on the server — only .env does, and compose auto-loads it from the working directory. The explicit --env-file silently caused recovered containers to start with empty secrets (e.g. blank MANA_AUTH_KEK), which made mana-auth crash the moment it came back up. The auto-recovery loop was therefore self-defeating: it kept "fixing" auth into the same broken state every 5 minutes for hours, with no notification because compose exited 0. Drop --env-file entirely and cd into PROJECT_ROOT so compose's standard .env discovery applies. 2. mana-infra-minio-init is a one-shot job container that legitimately sits in "exited" state after running once. The script flagged it as "stuck" every cycle, tried to "recover" it, and spammed the log with ERROR lines. Add an explicit ONESHOT_INIT_CONTAINERS allowlist and skip those names in both the initial scan and the post-recovery verification. Also tee compose output into the log so future failures actually leave a breadcrumb instead of disappearing into the void. Also: bump @mlc-ai/web-llm from a transitive dep (via @mana/local-llm) to a direct dep of @mana/web. SvelteKit's adapter-node post-build Rollup pass uses the web app's direct deps as its externals heuristic; without this entry it warns "@mlc-ai/web-llm ... could not be resolved - treating it as an external dependency" on every build. Functionally harmless (the dynamic import in LocalLLMEngine only fires in the browser), but the warning hid a real adapter-node misconfiguration that would have bitten us if we'd ever tried to SSR /llm-test. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 18:17:31 +02:00
Till JS	94ab125fbb	fix(mana/web): SessionWarning + SuggestionToast into bottom-stack Two more centred bottom-anchored toasts had the same problem the EncryptionIntroBanner had in `2a437a586`: their own position: fixed with hardcoded bottom + transform centring put them in stacking contexts that the QuickInputBar (z-index 90) either covered up (SessionWarning, z-index 45 → hidden) or sat under (SuggestionToast, z-index 9999 → covered the input bar instead). Both moved into .bottom-stack as .bottom-stack-notification children in (app)/+layout.svelte, with the parent handling positioning and the components themselves stripped down to in-flow flex items. - SessionWarning: was a free-floating element inside (app)/+layout but outside the bottom-stack — moved into the stack, kept the authStore.isAuthenticated gate so it only renders for logged-in users - SuggestionToast: was mounted in the ROOT layout, but its only consumer (automationsStore) is an (app)-only module so the toast never made sense on auth/landing pages. Moved into (app) bottom- stack, removed from root layout CSS cleanup in both: dropped position: fixed, bottom, left, transform, max-width, z-index. Slide-up keyframes rewritten to use translateY only (no more parent-transform-X to fight with). Stack order in (app)/+layout.svelte from top to bottom now: 1. EncryptionIntroBanner (one-time) 2. NotificationBar (guest nudge, conditional) 3. SessionWarning (auth-only, conditional) 4. SuggestionToast (auto-dismissing, conditional) 5. QuickInputBar 6. TagStrip 7. PillNav Corner-anchored toasts (PwaUpdatePrompt right-12px, SyncConflictToast right-1rem) intentionally NOT moved — they live in different visual real estate and don't compete with the centred stack column. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 18:16:55 +02:00
Till JS	2a437a5861	fix(mana/web): EncryptionIntroBanner sits inside the bottom-stack The one-time encryption intro banner used its own position: fixed at bottom: 1.5rem with z-index: 60, mounted at the root layout level. That put it in a stacking context the QuickInputBar covered up — the search bar visually sat ON TOP of the banner instead of below it, making the privacy claim half-readable and the dismiss X impossible to click. Same fix the guest nudge got in `c8ed58b7d`: move into the bottom-stack flex container in (app)/+layout.svelte and let the parent handle positioning. The banner is now the FIRST child of the stack so it renders above the guest nudge / QuickInputBar / TagStrip / PillNav and stays in flow as the stack reflows when nav collapses. - Removed `<EncryptionIntroBanner />` from root +layout.svelte (it doesn't belong above the (app) gate anyway since it self-checks isVaultUnlocked() which is always false outside auth context) - Mounted inside `.bottom-stack` as the first `.bottom-stack-notification` child in (app)/+layout.svelte - Stripped position: fixed / bottom / left / transform / max-width / z-index from the banner CSS — now an in-flow flex item with width: 100% (the wrapper centres + caps width via the existing bottom-stack-notification rules) - Slide-up animation rewritten to use translateY only since the parent no longer transforms the banner Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 18:08:35 +02:00
Till JS	09f81d77de	test(integration): assert security_events audit rows after register + login Adds two assertions to the auth-flow integration test that exercise the audit-log path: - after register: expect exactly 1 REGISTER row in auth.security_events for the new user - after login: expect exactly 1 LOGIN_SUCCESS row for the same user This locks in the fix from the previous commit (security.ts ?? null guard for optional fields) and catches any future regression where security.logEvent silently swallows a SQL error and the audit log goes into the void. Verified by reverting security.ts to the broken pre-fix version and re-running — the test fails with `Expected: 1, Received: 0` at the register-audit assert in 211ms instead of taking hours of production debugging. Also adds an explicit DELETE FROM auth.security_events to the afterAll cleanup. The FK from security_events.user_id to auth.users(id) is ON DELETE CASCADE so it would clean up implicitly anyway, but listing it explicitly makes the cleanup intent obvious from the test source. Net: 24 → 26 expects per run. Still ~22s end-to-end on a warm cache.	2026-04-08 18:05:57 +02:00
Till JS	624f5ce00b	fix(csp): allow wasm-unsafe-eval so @mana/local-llm can instantiate WebLLM WebAssembly.instantiate() was blocked by script-src on every app using shared security headers. 'wasm-unsafe-eval' is the narrow CSP source that whitelists WASM compilation only — it does NOT re-enable eval() or new Function(). Required by the MLC WebGPU runtime that powers the in-browser Qwen models on /llm-test. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 18:05:30 +02:00
Till JS	01632dfa49	docs(devlog): voice quick-add pipeline + LLM parsing live in prod Covers today's voice quick-add session: shared <VoiceCaptureBar> across five modules, generic /api/v1/voice/{transcribe,parse-task, parse-habit} endpoints, LLM-driven structure extraction with tag matching, the mana-llm Ollama routing fix (Colima LAN-range RST gotcha), gemma3:12b + few-shot prompt iteration, 49 unit tests, the .env.secrets persistent dev-secret layer, and the two real bugs found during prod deployment (SvelteKit prod-export restriction + $env/dynamic/private PUBLIC_-prefix exclusion). 18 commits, ~+1.220 LOC net, voice quick-add now live on mana.how for todo + habits with full structured extraction. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 18:01:21 +02:00
Till JS	4fd5ff3199	feat(local-llm): add Gemma 2 + allow HF/MLC hosts in CSP WebLLM was blocked by connect-src — model config and weight shards live on huggingface.co (+ cdn-lfs.* for LFS), and the WebGPU model_lib WASM comes from raw.githubusercontent.com (binary-mlc-llm-libs). Also wires Gemma 2 2B/9B into the model registry so /llm-test picks them up. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 18:00:57 +02:00
Till JS	ed746297b5	fix(mana-auth): security_events INSERT crashed on undefined optional fields logEvent() builds its INSERT via a raw `sql` tagged template: sql\`INSERT INTO auth.security_events (..., user_id, ip_address, user_agent, metadata, ...) VALUES (..., \${params.userId}, \${params.ipAddress}, \${params.userAgent}, \${...metadata}, ...)\` Most call sites only pass userId+eventType (or only eventType for the LOGIN_FAILURE / PASSWORD_RESET_REQUESTED / PROFILE_UPDATED / PASSWORD_CHANGED / ACCOUNT_DELETED events). The other params land in the template as `undefined`, and postgres-js's tagged-template renderer collapses `${undefined}` into literal nothing — producing this: VALUES (gen_random_uuid(), $1, $2, , , $3::jsonb, NOW()) ^^^^ Postgres rejects with "syntax error at or near \",\"". The catch block swallowed it as a `console.warn('Failed to log security event (non-critical):', params.eventType)` with no error detail, which is why this has been silently broken for who knows how long — every register, every login, every password change has been losing its audit row. Fix: - Coerce optional params to `null` (`params.userId ?? null`) before interpolation. NULL is what postgres-js renders for an explicit null. - Surface the actual error in the catch warn so the next time something similar happens it shows up in logs instead of just "non-critical". Verified the diagnosis by toggling `log_statement = all` on the test postgres, triggering a register, and reading the literal failed statement out of postgres logs.	2026-04-08 17:59:23 +02:00
Till JS	4fce6a3ede	feat(env): persistent dev secrets via .env.secrets override Local dev secrets like MANA_STT_API_KEY had no persistent home — they lived only in the gitignored, generator-overwritten per-app .env files. Every `pnpm setup:env` wiped them, so devs had to re-paste keys after any env regeneration. Same recurring friction for MANA_LLM_API_KEY, MANA_AUTH_KEK, OAuth keys, etc. New layer: `.env.secrets` at the repo root. - Gitignored, optional, never required for the build to pass - Read by generate-env.mjs AFTER .env.development; non-empty values override the matching key, so the merged result drives every per-app .env the generator writes - Empty values fall through to the .env.development defaults — a freshly-copied .env.secrets.example is a no-op - One source of truth for all dev secrets, propagated to every app with one `pnpm setup:env` Files: - `.env.secrets.example` — committed template documenting all known secret keys (mana-stt, mana-llm, auth KEK, sync JWT, MinIO, third- party APIs). Devs `cp .env.secrets.example .env.secrets` and fill in. - `.gitignore` — ignores .env.secrets, allows .env.secrets.example - `scripts/generate-env.mjs` — loads .env.secrets if present, prints "Loaded N secrets from .env.secrets" so devs see the override taking effect - `scripts/setup-secrets.mjs` + `pnpm setup:secrets` — convenience script that SSHes to mana-server, greps the prod .env for the keys defined in .env.secrets.example, and writes them locally. Confirms before overwriting an existing .env.secrets unless --force is set; reports which keys couldn't be found on the remote so devs know what's left to fill manually - `docs/LOCAL_DEVELOPMENT.md` + `docs/ENVIRONMENT_VARIABLES.md` — walk-through and architecture diagram update Verified end-to-end: - `rm .env.secrets apps/mana/apps/web/.env && pnpm setup:env` → STT key empty (no regression for devs who haven't opted in) - `pnpm setup:secrets --force && pnpm setup:env` → STT key propagated, "Loaded 3 secrets from .env.secrets" in output - POST /api/v1/voice/transcribe with a real audio file → full transcript back via gpu-stt.mana.how, end-to-end working Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 17:50:37 +02:00
Till JS	93748c0c9c	feat(playground): real LLM playground module backed by mana-llm + saved snippets The playground route was previously a stub. This turns it into a proper module: - A streaming chat surface that talks to mana-llm's OpenAI-compatible /v1/chat/completions and /v1/models. The SSE chunk parser is hand-rolled in modules/playground/llm.ts (~30 lines) rather than pulling a dep — the wire format is straight OpenAI and the playground is the only consumer right now. If chat / todo enrichment / cycles insights end up hitting the same surface, this lifts cleanly into $lib/data/llm-client.ts. - A persisted snippets store: name + systemPrompt + (model, temperature) defaults that the user can pin and reorder. Stateless chat history stays out — that's what the chat module is for. Both `name` and `systemPrompt` are encrypted (same pattern as notes/dreams), with a registry entry in data/crypto/registry.ts and a Dexie schema in data/database.ts. - Standard module wiring: collections.ts / queries.ts / types.ts / stores/snippets.svelte.ts / module.config.ts, registered in module-registry.ts alongside the other 30+ modules. - ListView.svelte and the (app)/playground/+page.svelte route consume the new store + the streaming client. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 17:41:18 +02:00
Till JS	d3a1f00072	feat(crypto): roll places + locationLogs into the encryption registry Phase 8 follow-up. Places carries GDPR-sensitive PII so it gets the same treatment as the rest of Phase 7+8, with one deliberate carve-out: - `places` encrypts the user-typed surface (name / description / address) but leaves lat/lng plaintext so the proximity matcher in tracking.svelte.ts can run during background geolocation logging without a vault unlock. The trade-off is documented inline in registry.ts: a handful of named POIs is much less sensitive than the full movement trail. - `locationLogs` IS the movement trail, so every coordinate field (latitude, longitude, accuracy, altitude, speed, heading) is encrypted. Indexed columns (timestamp, placeId, [placeId+timestamp]) stay plaintext for the time-range scans in the log view. - `placeTags` stays out of the registry — pure FK join table, no user content, same pattern as manaLinks. queries.useAllPlaces / useLocationLogs now decrypt before mapping to the DTO. placesStore.create/update snapshot the plaintext DTO before encryptRecord mutates the local in place — same pattern as notes/dreams/contacts. trackingStore.logPosition decrypts the place set before running the nearest-place match (the lat/lng carve-out means this still works pre-unlock, but downstream consumers want the decrypted name). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 17:41:00 +02:00
Till JS	e8de377cfe	fix(macmini): mount prometheus config directly so /-/reload picks up edits VictoriaMetrics + vmalert previously copied prometheus.yml/alerts.yml from /mnt/prometheus-config/ into /etc/prometheus/ at container start. The copy silently drifted from the host file whenever the container wasn't restarted — which is exactly what hid the matrix/element removal from status.mana.how until 2026-04-08, when VM was still actively scraping the deleted targets because its in-container config snapshot pre-dated the cleanup. Now both containers mount ./docker/prometheus directly into /etc/prometheus (resp. /etc/alerts) read-only and point the binary at it, and deploy.sh issues POST /-/reload to both after each deploy so config edits go live without a container recreate. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 17:25:48 +02:00
Till JS	a7f3577ee2	fix(voice): set MANA_LLM_URL on mana-web — $env/dynamic/private hides PUBLIC_ The first prod deploy of voice quick-add (`3b41b39a3`) silently fell back for every transcript: title=transcript verbatim, dueDate=null, priority=null, labels=[]. The endpoint code was reaching the fallback() path even though mana-llm was healthy and reachable from inside the mana-web container. Root cause: SvelteKit's $env/dynamic/private explicitly excludes any env var that starts with the public prefix (default PUBLIC_). The parse-task code read env.MANA_LLM_URL \|\| env.PUBLIC_MANA_LLM_URL \|\| 'http://localhost:3025' expecting to fall back to PUBLIC_MANA_LLM_URL when MANA_LLM_URL was unset, but $env/dynamic/private treats PUBLIC_MANA_LLM_URL as if it didn't exist on the server side. So it always fell through to http://localhost:3025, which from inside mana-web is nothing, fetch threw, and coerce returned the fallback shape. Two fixes: 1. docker-compose.macmini.yml — set MANA_LLM_URL (no prefix) on mana-web alongside PUBLIC_MANA_LLM_URL. The PUBLIC_ var is still needed for the browser-side playground and status page; the private one is what the parse endpoints actually read. 2. parse-task and parse-habit — drop the dead env.PUBLIC_MANA_LLM_URL fallback so the next dev who reads the code doesn't think it'd ever work. Add a comment explaining the SvelteKit gotcha so the next person setting up a new env var doesn't repeat this mistake. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 17:20:03 +02:00
Till JS	5af4ddab3c	test(integration): end-to-end auth flow test with Mailpit + CI gating Adds a 13-step integration test that exercises register → email verification → login → JWT validation → /me/data → encryption-vault init/key → logout against a real stack of postgres + redis + mailpit + mana-auth + mana-notify in docker compose. Verified locally that this catches every regression we hit on 2026-04-08 in well under a second: - missing nanoid dependency → register endpoint 500 - missing MANA_AUTH_KEK env passthrough → mana-auth never starts - missing encryption-vault SQL migrations → vault endpoints 500 - wrong cookie name in /api/v1/auth/login → no accessToken in response - mana-notify SMTP misconfigured → mailpit poll times out Files: - docker-compose.test.yml — minimal isolated stack on alt ports (postgres 5443, redis 6390, mailpit 1026/8026, mana-auth 3091, mana-notify 3092). Runs alongside the dev stack without collision. Postgres healthcheck runs a real query rather than just pg_isready to avoid the race where pg_isready reports healthy while the docker init scripts are still running on a unix socket. - tests/integration/auth-flow.test.ts — bun test that drives the full flow via fetch + mailpit's REST API. Cleans up its test user from postgres in afterAll. Self-contained, no extra deps. - tests/integration/README.md — what's covered, why it exists, how to run locally + extend. - scripts/run-integration-tests.sh — orchestrator. Brings up the stack, pushes the @mana/auth Drizzle schema, applies the encryption-vault SQL migrations (002, 003), restarts mana-auth so it sees the fresh tables, runs the test, tears down on exit. KEEP_STACK=1 to leave it up for manual mailpit inspection. - docker-compose.dev.yml — also adds Mailpit as a regular dev service (ports 1025/8025) so local development can have a working email capture without spinning up the test stack. - .github/workflows/ci.yml — new auth-integration job that runs on every PR. Calls run-integration-tests.sh; on failure dumps mana-auth + mana-notify logs and the mailpit message queue. Marked as a required check via the existing PR validation pipeline. Reproduced 3 clean runs and 1 negative-control run (removed nanoid from package.json → mana-auth container exits → script aborts with non-zero) before committing. Full happy path runs in ~22s on a warm Docker cache.	2026-04-08 17:14:02 +02:00
Till JS	3b41b39a32	fix(voice/parse-task): extract helpers to coerce.ts so prod build passes SvelteKit's production build forbids non-handler exports from a +server.ts file — dev runs them fine but `pnpm build` errored with "Invalid export 'coerce' in /api/v1/voice/parse-task" when trying to deploy mana-web with the new unit tests. Move ParseResult, fallback, DATE_TRIGGER_PATTERNS, PRIORITY_TRIGGER_PATTERNS, transcriptMentions, coerce, and extractJson into a sibling coerce.ts module. The +server.ts file imports from there and only exports POST, which is the prod build's hard rule. Tests now import from ./coerce instead of from the route handler, which also drops the $env/dynamic/private resolution dance from the test fast path — coerce.test.ts now runs in ~130ms instead of ~400ms because it pulls in zero SvelteKit runtime. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 17:12:26 +02:00
Till JS	4f6609a595	docs(devlog): backfill daily devlogs for 2026-04-01 through 2026-04-07 Fills the gap between the last entry (2026-03-31) and today. The 2026-04-06 slot is intentionally skipped — no commits that day. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 17:11:10 +02:00
Till JS	0119e48edb	test(voice/parse-task): unit tests for coerce + transcriptMentions guards The deterministic guards in parse-task's coerce() are the load-bearing defense against gemma3 hallucinating dueDate / priority on bare tasks. The integration tests against the live LLM cover the happy path end-to-end, but they go offline as soon as mana-llm is unreachable — the unit tests cover the guard logic in isolation with synthetic LLM responses, so a regression in the rules is caught even when the LLM itself is dark. 22 cases: - transcriptMentions: substring matching, case-insensitivity, empty pattern list, the German + English date words from the few-shot examples, and the negative cases ("Mülltonnen rausstellen", "Buy milk") that must NOT trigger. - coerce: fallback shape on garbage input, transcript-as-title when the model omits one, time-component stripping ("2026-04-09T14:00:00" → "2026-04-09"), malformed dueDate rejection, the dueDate / priority hallucination guards (drop when the transcript has no trigger word), real-date / real-priority preservation, label filtering (cap at 3, drop non-strings, empty array on non-array input), invalid priority value rejection. Helpers exported solely for the tests via a __test object — the production endpoint goes through buildPrompt + coerce as before. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 17:04:52 +02:00
Till JS	7007140d13	fix(voice): switch to gemma3:12b + few-shot prompt for parse-task Two related changes that fall out of real end-to-end testing against the now-working local mana-llm. 1. Default model bumped from gemma3:4b to gemma3:12b for both parse-task and parse-habit. The 4b model gets weekday math off-by-one ("nächsten Montag" from a Wednesday → 2026-04-14 instead of 2026-04-13), aggressively shortens titles ("Anna anrufen" → "Anrufen"), and frequently paraphrases habit names instead of copying verbatim ("Joggen" instead of "Laufen") which the verbatim-validation in coerce drops, costing an LLM round-trip for nothing. The 12b variant is roughly 10% slower for these tiny prompts (~1.1s vs ~1.0s on the GPU box) so the accuracy win is essentially free. 2. parse-task prompt rewritten as few-shot. Pure rule descriptions were worse than simple examples — the long "Rules — read carefully" section in the previous prompt actually made the model compute next Monday as 2026-04-14 even though a direct "what date is next Monday?" prompt to the same model returned 2026-04-13. The detailed rules were also priming the model to over-shorten titles and over-eagerly tag filler words. Five worked examples (including the previously-failing "Anna nächsten Montag anrufen" case) plus one novel case ("Mama am Wochenende besuchen") all come back correct now, including for the novel one. The deterministic guards in coerce() are kept as a backstop for the day the GPU box swaps in a weaker model — they're cheap and don't hurt the happy path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:59:32 +02:00
Till JS	68e8897c9c	chore(env): default MANA_LLM_URL to llm.mana.how Same convention as STT_URL — nobody runs mana-llm in local Docker for dev work, the shared gateway is always reachable, so the path of least friction is to point at it by default. Devs who want a fully offline stack can still override the var locally. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:55:01 +02:00
Till JS	b505024f7b	fix(voice/parse-task): guard against gemma3:4b hallucinating dueDate + priority Real end-to-end testing against the now-working local mana-llm surfaced two model behaviours the prompt couldn't talk down: 1. gemma3:4b stamps today's date on every task that doesn't have a real time anchor. "Mülltonnen rausstellen" came back with dueDate=2026-04-08 and priority=low even though the prompt explicitly said "MUST be null when no date is mentioned". After typing "Buy milk" the user would silently get a today-due task, which is worse than no parsing at all. 2. The model occasionally returns dueDate as a full ISO timestamp ("2026-04-09T14:00:00") when the transcript mentions a time. The coerce regex previously matched the prefix and let the timestamp through unchanged, which then breaks the YYYY-MM-DD-shaped Dexie field downstream. Fix: deterministic post-processing in coerce. The prompt is also tightened with explicit "ONLY when…" rules but the guards are the load-bearing change since gemma3:4b ignores prompt restrictions. - Strict YYYY-MM-DD extraction: a leading-anchor regex match keeps only the date prefix even if the model adds a time component. - DATE_TRIGGER_PATTERNS: substring scan over the original transcript for German + English date words. If the LLM returned a dueDate but the transcript has zero matches, drop the date — it was a hallucination. False positives are preferable to false negatives: letting through a fake date is more annoying than suppressing a real one the user can re-type. - PRIORITY_TRIGGER_PATTERNS: same idea for priority. The model thinks taxes are inherently urgent; we don't want to inherit that opinion. The labels field is left noisy on purpose — "müll", "unbedingt", "erledigen" all come back from a single transcript and only the ones that fuzzy-match an existing workspace tag end up on the task, so filtering filler words at this layer would be wasted work. Verified against five transcripts spanning bare/explicit/relative date in DE + EN. Real LLM round-trip via http://localhost:5173 → https://llm.mana.how → ollama gemma3:4b. Local mana-llm now reaches its Ollama backend after the gpu-proxy routing fix in `7f382138a`. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:50:19 +02:00
Till JS	bfeeef7819	chore(matrix): final scrub of stale matrix references A grep audit after the previous matrix removal commits found a handful of stragglers in non-runtime files that the earlier sweeps missed: - services/mana-llm/CLAUDE.md: removed matrix-ollama-bot from the consumer-apps diagram and from the related-services table - services/mana-video-gen/CLAUDE.md: removed "Matrix Bots" integration bullet - packages/notify-client/README.md: removed sendMatrix() doc entry (the method itself was already gone in the prior cleanup) - docker/grafana/dashboards/logs-explorer.json: dropped the "Matrix Stack" log row that queried tier="matrix" (would show no data forever) - docker/grafana/dashboards/master-overview.json: dropped the "Matrix Bots" stat panel that counted up{job=~"matrix-.*-bot"} - apps/mana/apps/landing/src/data/ecosystem-health.json: regenerated via scripts/ecosystem-audit.mjs to drop matrix from the app list, icon counts, file analytics, top offenders and authGuard missing list - .gitignore: removed services/matrix-stt-bot/data/ pattern (the service itself was deleted long ago) Production-side stragglers also addressed (not in this commit): - DROP USER synapse on prod Postgres (the parallel cleanup commit `2514831a3` dropped DATABASE matrix + DATABASE synapse but left the role behind) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:47:54 +02:00
Till JS	7f382138a1	fix(mana-llm): route Ollama through gpu-proxy instead of LAN IP The mana-service-llm container had OLLAMA_URL pointed at the GPU box's LAN address (192.168.178.11:11434). On the Mac Mini host that route works fine, but from inside any Colima container the entire 192.168.178.0/24 subnet gets synthesized RST — Colima's VM "claims" the LAN range without being able to route to it, so every connect() returns "Connection refused" before a packet ever leaves the box. mana-llm started cleanly, reported the configured upstream as "unhealthy", served an empty /v1/models list, and every chat completion failed with "All connection attempts failed". The most visible downstream effect: voice quick-add (parse-task, parse-habit) silently degraded to its no-LLM fallback for everyone hitting the local stack — same shape as a successful response, no error log, just no enrichment. The Mac Mini already runs a gpu-proxy LaunchAgent (com.mana.gpu-proxy, /Users/mana/gpu-proxy.py) that forwards 127.0.0.1:13434 → 192.168.178.11:11434 alongside several other GPU service ports. Pointing OLLAMA_URL at host.docker.internal:13434 and adding the host-gateway extra_hosts mapping puts mana-llm on the already-running rail. Verified end-to-end: from inside the container, GET http://host.docker.internal:13434/api/tags now returns the full model list (gemma3:4b, gemma3:12b, gemma3:27b, qwen2.5-coder:14b, nomic-embed-text). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:46:14 +02:00
Till JS	da6e2f39da	chore(deps): update pnpm-lock after Matrix stack removal Reflects the removal of apps/matrix and services/mana-matrix-bot from the workspace plus the dropped @matrix-org/matrix-sdk-crypto-nodejs override in package.json. Net -365 lines. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:41:15 +02:00
Till JS	029c7973ef	feat(mana/web): pass MANA_LLM_API_KEY from voice parse proxies The /api/v1/voice/parse-task and /api/v1/voice/parse-habit endpoints forwarded transcripts to mana-llm without an X-API-Key header. This worked against the local mana-llm container (no auth) but silently fell back to the no-LLM path when pointed at gpu-llm.mana.how, which requires an API key — voice quick-add would look like it was running in degraded mode forever with no signal that auth was the cause. Now both endpoints read MANA_LLM_API_KEY from the server-side env and attach it as X-API-Key when present, mirroring the pattern already used by /api/v1/voice/transcribe for mana-stt. When the var is empty the header is omitted, so local Docker setups without auth still work. Plumbing: generate-env.mjs writes MANA_LLM_URL + MANA_LLM_API_KEY into apps/mana/apps/web/.env, .env.development gets the new keys with empty defaults, ENVIRONMENT_VARIABLES.md documents the gateway and where to get a key. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:40:26 +02:00
Till JS	2514831a3b	chore(matrix): scrub final matrix references after subsystem removal The matrix subsystem was removed in a prior commit. This commit cleans up the small leftovers that grep found: - docker-compose.macmini.yml: dropped the "Matrix Stack" port-range comment, the "matrix" category from the naming convention, and a stale watchtower comment about Matrix notifications. - packages/credits/src/operations.ts: removed AI_BOT_CHAT credit operation type and its definition. It was the billing entry for "Chat with AI via Matrix bot" — no callers left. - services/mana-credits gifts schema + service + validation: removed the targetMatrixId column / param / Zod field. The corresponding PostgreSQL column was dropped manually with `ALTER TABLE gifts.gift_codes DROP COLUMN target_matrix_id` on prod. - docker/grafana/dashboards/{master,system}-overview.json: removed the `up{job="synapse"}` panel queries — they would have shown No Data forever now that Synapse is gone. Production-side cleanup performed in parallel (not in this commit): - Stopped + removed mana-matrix-{synapse,element,web,bot} containers - Removed mana-matrix-bot:local, matrix-web:latest, matrixdotorg/synapse:latest, vectorim/element-web:latest images (~3 GB) - Removed mana-matrix-bots-data Docker volume - Removed /Volumes/ManaData/matrix/ media store (4.3 MB) - DROP DATABASE matrix; DROP DATABASE synapse; on Postgres Cosmetic leftovers intentionally untouched: - Eisenhower matrix in todo (LayoutMode 'matrix') — productivity concept - ${{ matrix.service }} in .github/workflows — GitHub Actions strategy - services/mana-media/apps/api/dist/.../matrix/* — stale build output (not in git, regenerated next mana-media build)	2026-04-08 16:39:42 +02:00
Till JS	e337243303	test(mana/web): unit tests for voice quick-add matchers + fix habit ranking Two new test files lock in the matching boundary where free-text LLM hints meet the user's actual workspace data — that's where bugs hide silently. Both matchers are now pure-function-shaped (the production wrappers just feed them Dexie data) so the tests run without fake-indexeddb or any I/O. todo: 16 cases for matchLabelsToTagsPure covering exact / case / diacritic / substring / specificity rules + the "never invent tags" guarantee. habits: 11 cases for matchHabitToTranscript including the word- boundary "Bier vs ausprobiert" false-positive, multi-word matching, and a real bug the test surfaced on the first run: Without specificity ranking, "Tee" would always beat "Grüner Tee" because the first matching habit in input order won. The matcher now collects all candidates and returns the one with the most matched tokens, so multi-word habits beat single-word substrings whenever both could fit the transcript. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:37:11 +02:00
Till JS	8e8b6ac65f	fix(mana-auth) + chore: rewrite /api/v1/auth/login JWT mint, remove Matrix stack This commit bundles two unrelated changes that were swept together by an accidental `git add -A` in another working session. Documented here so the history reflects what's actually inside. ═══════════════════════════════════════════════════════════════════════ 1. fix(mana-auth): /api/v1/auth/login mints JWT via auth.handler instead of api.signInEmail ═══════════════════════════════════════════════════════════════════════ Previous attempt (commit `55cc75e7d`) tried to fix the broken JWT mint in /api/v1/auth/login by switching the cookie name from `mana.session_token` to `__Secure-mana.session_token` for production. That was necessary but not sufficient: Better Auth's session cookie value isn't just the raw session token, it's `<token>.<HMAC>` where the HMAC is derived from the better-auth secret. Reconstructing the cookie from auth.api.signInEmail's JSON response only gave us the raw token, so /api/auth/token's get-session middleware still couldn't validate it and the JWT mint kept silently failing. Real fix: do the sign-in via auth.handler (the HTTP path) rather than auth.api.signInEmail (the SDK path). The handler returns a real fetch Response with a Set-Cookie header containing the fully signed cookie envelope. We capture that header verbatim and forward it as the cookie on the /api/auth/token request, which now passes validation and mints the JWT correctly. Verified end-to-end on auth.mana.how: $ curl -X POST https://auth.mana.how/api/v1/auth/login \ -d '{"email":"...","password":"..."}' { "user": {...}, "token": "<session token>", "accessToken": "eyJhbGciOiJFZERTQSI...", ← real JWT now "refreshToken": "<session token>" } Side benefits: - Email-not-verified path is now handled by checking signInResponse.status === 403 directly, no more catching APIError with the comment-noted async-stream footgun. - X-Forwarded-For is forwarded explicitly so Better Auth's rate limiter and our security log see the real client IP. - The leftover catch block now only handles unexpected exceptions (network errors etc); the FORBIDDEN-checking logic in it is dead but harmless and left in for defense in depth. ═══════════════════════════════════════════════════════════════════════ 2. chore: remove the entire self-hosted Matrix stack (Synapse, Element, Manalink, mana-matrix-bot) ═══════════════════════════════════════════════════════════════════════ The Matrix subsystem ran parallel to the main Mana product without any load-bearing integration: the unified web app never imported matrix-js-sdk, the chat module uses mana-sync (local-first), and mana-matrix-bot's plugins duplicated features the unified app already ships natively. Keeping it alive cost a Synapse + Element + matrix-web + bot container quartet, three Cloudflare routes, an OIDC provider plugin in mana-auth, and a steady drip of devlog/dependency churn. Removed: - apps/matrix (Manalink web + mobile, ~150 files) - services/mana-matrix-bot (Go bot with ~20 plugins) - docker/matrix configs (Synapse + Element) - synapse/element-web/matrix-web/mana-matrix-bot services in docker-compose.macmini.yml - matrix.mana.how/element.mana.how/link.mana.how Cloudflare tunnel routes - OIDC provider plugin + matrix-synapse trustedClient + matrixUserLinks table from mana-auth (oauth_* schema definitions also removed) - MatrixService import path in mana-media (importFromMatrix endpoint) - Matrix notification channel in mana-notify (worker, metrics, config, channel_type enum, MatrixOptions handler) - Matrix entries from shared-branding (mana-apps + app-icons), notify-client, the i18n bundle, the observatory map, the credits app-label list, the landing footer/apps page, the prometheus + alerts + promtail tier mappings, and the matrix-related deploy paths in cd-macmini.yml + ci.yml Devlog/manascore/blueprint entries that mention Matrix are left intact as historical record. The oauth_* + matrix_user_links Postgres tables stay on existing prod databases — code can no longer write to them, drop them in a follow-up migration if you want them gone for real. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:32:13 +02:00
Till JS	4eb5dfe4a0	feat(mana/web): named workbench scenes (Home, Deep Work, …) Users can now define multiple named layouts of the workbench homepage and switch between them. Each scene holds its own openApps list with per-app window state (minimized / maximized / size). Scene list syncs cross-device via mana-sync; the active scene id is per-device (localStorage) so device A doesn't pull device B into a different scene. - new `workbenchScenes` Dexie table, registered in manaCoreConfig - `workbenchScenesStore` (Dexie liveQuery) with scene CRUD + per-scene app mutations; auto-seeds a default "Home" scene on first run - SceneTabs pill bar above the carousel with dnd reorder + context menu (rename / duplicate / delete); SceneRenameDialog and a reusable ConfirmDialog for the destructive path - workbench +page.svelte refactored to delegate all openApps mutations to the store; the carousel itself is unchanged Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:25:13 +02:00
Till JS	d37483a0f9	feat(todo): map LLM topic hints to existing workspace tags The parse-task endpoint already returns free-text label hints from the LLM ("steuern", "haushalt", …). Now the todo store fuzzy-matches each hint against the user's existing tags via tagCollection and assigns the matched IDs to the task's metadata.labelIds. Match policy is intentionally conservative: - Normalize via NFD strip + lowercase + collapsed whitespace - Exact normalized match wins - Substring fallback only for ≥3 char strings (avoids "ab" hitting every tag containing "ab") - Never auto-creates a tag — even if the LLM is sure, an unknown topic silently drops, because auto-creating would clutter the user's tag list with one-off duplicates from voice transcripts Both flows pick this up: voice always (transcripts almost always carry topic hints) and typed only when there's structured payoff, same asymmetry as before — typed quick-add now also enriches when the LLM just finds a tag match without a date or priority. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 16:21:32 +02:00

1 2 3 4 5 ...

2692 commits