managarten

mirror of https://github.com/Memo-2023/mana-monorepo.git synced 2026-05-14 20:21:09 +02:00

Author	SHA1	Message	Date
Till JS	ed01d24f2d	feat(ai): add AI tools for myday, goals, mood, finance, and times Expand agent tool coverage from 28 to 47 tools across 16 modules: - myday: get_myday_summary (full daily context in one call) - goals: list_goals, get_goal_progress, create_goal, pause/resume/complete_goal - mood: log_mood, get_mood_today, get_mood_insights (trends + correlations) - finance: extend add_transaction, add get_month_summary + list_transactions - times: extend start/stop_timer, add get_timer_status, get_time_stats, list_projects All tools registered in both AI_TOOL_CATALOG (shared-ai) and webapp init. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 15:01:12 +02:00
Till JS	acd7e0d6b0	docs: update architecture comparison — 5/10 roadmap items done Update report to reflect all completed work: - Matrix: streaming ✅, tool registration updated to 29 tools + MCP - §5.2 Streaming: marked done - §5.3 Tool System: marked done - §6 Table: items 1-3 + 5 struck through with commit refs - §8 Fazit: updated gaps and recommendations 5 of 10 roadmap items complete in one session: 1. SSE Streaming, 2. Dynamic Tool Registry, 3. Budget Enforcement, 5. MCP Server Export (27/29 tools with DB ops), plus Tool Drift Fix. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 15:00:09 +02:00
Till JS	10acabfed6	feat(ai): tag-based agent scoping — agents see only their tagged records Connects the existing global tag system (@mana/shared-tags, 15+ module junctions, TagSelector UI) to the AI agent model so different agents can operate on different slices of the user's data. Core additions: 1. Agent.scopeTagIds — optional array of global tag IDs. When set, the agent sees only records tagged with at least one of those tags (plus untagged records, which stay globally visible). Empty/undefined = General-Agent, sees everything. Agent-editor grows a <TagSelector> under "Bereiche (Tag-Scope)". 2. Per-agent kontext documents — new Dexie table `agentKontextDocs` (v22, encrypted, synced). Each agent can have its own markdown context doc, replacing the global singleton auto-inject. Runner tries agent kontext first, falls back to global singleton when the agent has no dedicated doc. 3. Ambient scope context — `withAgentScope(tagIds, fn)` sets a module-level scope during the reasoning loop. Auto-tools read it via `getAgentScopeTagIds()` and filter their result sets. `filterByScope(records, getTagIds)` is the reusable filter primitive (keeps untagged records, drops mismatched tagged ones). 4. Notes tag junction — `noteTags` table (v22) + `noteTagOps` via `createTagLinkOps`. Notes was the only major module without structured tag support. `list_notes` now calls `filterByScope` so a scoped agent only sees notes tagged with its scope. Flow: mission starts → runner resolves owning agent → reads agent.scopeTagIds → wraps entire reasoning loop in withAgentScope → list_notes (and future list_tasks etc.) auto-filter → planner sees only scope-relevant records → proposes scoped edits. Runner tests: 8/8. shared-ai type-check: clean. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 13:43:33 +02:00
Till JS	56171ff13b	fix(ai): resolve tool name + parameter drift between catalog and webapp Some checks are pending CI / Build mana-api-gateway (push) Blocked by required conditions Details CI / Build mana-crawler (push) Blocked by required conditions Details CI / Build mana-media (push) Blocked by required conditions Details CI / Build mana-credits (push) Blocked by required conditions Details CI / Build mana-web (push) Blocked by required conditions Details CI / Build chat-backend (push) Blocked by required conditions Details CI / Build chat-web (push) Blocked by required conditions Details CI / Build todo-backend (push) Blocked by required conditions Details CI / Build todo-web (push) Blocked by required conditions Details CI / Build calendar-backend (push) Blocked by required conditions Details CI / Build calendar-web (push) Blocked by required conditions Details CI / Build clock-web (push) Blocked by required conditions Details CI / Build contacts-backend (push) Blocked by required conditions Details CI / Build contacts-web (push) Blocked by required conditions Details CI / Build presi-web (push) Blocked by required conditions Details CI / Build storage-backend (push) Blocked by required conditions Details CI / Build storage-web (push) Blocked by required conditions Details CI / Build telegram-stats-bot (push) Blocked by required conditions Details CI / Build food-backend (push) Blocked by required conditions Details CI / Build food-web (push) Blocked by required conditions Details CI / Build skilltree-web (push) Blocked by required conditions Details Docker Validate / Validate Dockerfiles (push) Waiting to run Details Docker Validate / Build calendar-web (push) Blocked by required conditions Details Docker Validate / Build quotes-web (push) Blocked by required conditions Details Docker Validate / Build todo-backend (push) Blocked by required conditions Details Docker Validate / Build todo-web (push) Blocked by required conditions Details Docker Validate / Build mana-auth (push) Blocked by required conditions Details Docker Validate / Build mana-sync (push) Blocked by required conditions Details Docker Validate / Build mana-media (push) Blocked by required conditions Details Mirror to Forgejo / Push to Forgejo (push) Waiting to run Details 8 mismatches fixed between AI_TOOL_CATALOG and webapp module tools: Tool name renames (webapp → catalog name): - record_visit → visit_place (places) - undo_last_drink → undo_drink (drink) - location_log → get_current_location (places, catalog side) Catalog parameter fixes (aligned to webapp execute functions): - create_event: startIso/endIso → startTime/endTime + isAllDay/location/description - create_note: title required→optional, content optional→required - complete_tasks_by_title: titleSubstring → titleMatch - create_place: add latitude/longitude (required) + category enum + address - create_journal_entry: English mood enum → German mood enum Webapp parameter additions: - create_contact: add company + notes params (store already accepts them) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 13:18:51 +02:00
Till JS	d40a61119e	refactor(ai): dynamic tool registry — single-source catalog in shared-ai Introduce AI_TOOL_CATALOG in @mana/shared-ai as the single source of truth for all 29 tool schemas (17 propose + 12 auto). Both the webapp policy and the server-side mana-ai planner now derive their tool lists from the catalog instead of maintaining independent hardcoded copies. - New: packages/shared-ai/src/tools/schemas.ts — catalog with ToolSchema type - Rewrite: proposable-tools.ts — derived from catalog instead of hardcoded array - Rewrite: services/mana-ai/src/planner/tools.ts — 277→30 lines (imports from catalog) - Simplify: webapp policy.ts — derives AUTO/PROPOSE from catalog defaultPolicy Adding a new tool now requires 2 files instead of 3-5: 1. Add schema to AI_TOOL_CATALOG (shared-ai) 2. Add execute function in the module's tools.ts (webapp) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 13:06:07 +02:00
Till JS	be81d11dc3	feat(ai): SSE streaming for foreground Mission Runner Enable real-time token streaming during the planner "calling-llm" phase so the user sees live progress ("empfange Plan… 128 tokens") instead of a static spinner. The parser still receives the full text once complete — no partial-JSON risk. Changes: - Extract shared SSE parser from playground into @mana/shared-llm/sse-parser - remote.ts: use stream:true when onToken callback is provided - AiPlanInput: add optional onToken field (shared-ai) - ai-plan task: pass onToken through to backend.generate() - runner.ts: throttled (500ms) phaseDetail updates during streaming - Playground: refactored to use shared SSE parser Also includes: AI agent architecture comparison report (docs/reports/) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:32:43 +02:00
Till JS	23b8cc13fb	feat(ai-tools): server-side web-research + contacts for agents Two major tool expansions — the Recherche-Agent and Today-Agent can now research the web autonomously (no browser needed), and a future Meeting-Prep agent can read + create contacts. === research_news (server-side execution) === The biggest addition: mana-ai can now call mana-api's news-research endpoints (POST /discover + /search) directly, without a browser. Infrastructure: - services/mana-ai/src/planner/news-research-client.ts — full HTTP client with discover→search pipeline. 15s/30s timeouts. Graceful null on any failure (network, mana-api down, bad response) so the tick never crashes from research errors. - config.manaApiUrl added (default http://localhost:3060); wired in docker-compose.macmini.yml as http://mana-api:3060 + depends_on mana-api with service_healthy condition. Pre-planning research step (cron/tick.ts): - Before the planner prompt is built, the tick checks if the mission's objective or conceptMarkdown matches research keywords (same RESEARCH_TRIGGER regex the webapp uses). When it matches: * NewsResearchClient.research(objective) runs discovery + search * Results are injected as a synthetic ResolvedInput with id '__web-research__' and a formatted markdown context block * The Planner then sees real article URLs/titles/excerpts and can reference them in create_note / save_news_article steps * Log line: "pre-research: N feeds, M articles" Tool registration: - research_news added to AI_PROPOSABLE_TOOL_NAMES + mana-ai tools.ts with params (query, language?, limit?). This lets the planner also explicitly propose a research step as a PlanStep (in addition to the pre-planning auto-injection). === create_contact === - Added to AI_PROPOSABLE_TOOL_NAMES + mana-ai tools.ts with params (firstName required, lastName/email/phone/company/notes optional). - Contacts are encrypted at rest; server planner can plan the step but execution stays on the webapp (same as all propose tools). Full server-side contact resolution via Key-Grant is a future enhancement. - get_contacts added to webapp AUTO_TOOLS so agents can inspect existing contacts without nagging (read-only, auto-policy). Module coverage now: ✅ todo (5) ✅ calendar (2) ✅ notes (5) ✅ places (4) ✅ drink (3) ✅ food (2) ✅ news (1) ✅ journal (1) ✅ habits (3) ✅ news-research (1) ✅ contacts (1) 11 modules, 28 tools total (17 propose, 11 auto). Tests: mana-ai 41/41 (drift-guard passes), shared-ai type-check clean, webapp svelte-check 0 errors, 0 warnings. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:25:45 +02:00
Till JS	1266b583e4	feat(ai-tools): unlock create_note + create_journal_entry + habit tools for agents Closes the three biggest tool-coverage gaps so the shipped agent templates can actually do their job end-to-end. Before this, the Recherche-Agent couldn't create notes (only edit), the Today-Agent couldn't create journal entries, and no habit-related tool was server-proposable at all. shared-ai (proposable-tools.ts): - create_note (notes) — key unlock: Recherche-Agent now creates per-source notes and the summary report. - create_journal_entry (journal) — key unlock: Today-Agent proposes a poem as a journal entry with optional mood. - create_habit (habits) — agent can suggest new habits. - log_habit (habits) — agent can log a habit completion for today. Organized the list with per-module section comments for readability now that we're at 15 proposable tools. mana-ai (planner/tools.ts): - 5 new tool definitions with full parameter schemas: * create_note (title, content?) * create_journal_entry (content, title?, mood? enum) * create_habit (title, icon, color) * log_habit (habitId, note?) - Drift-guard contract test passes (41/41) — confirms the mana-ai tool list is in sync with the shared-ai canonical set. Webapp (policy.ts): - get_habits added to AUTO_TOOLS (read-only; agent can inspect which habits exist without nagging the user for approval). - list_notes added to AUTO_TOOLS (was already used in the reasoning loop but missing from the explicit auto-list; the planner default fell through to 'propose' which was wasteful for a read op). Module coverage after this change: ✅ todo (5 tools) ✅ calendar (2) ✅ notes (5 incl. create) ✅ places (4) ✅ drink (3) ✅ food (2) ✅ news (1) ✅ journal (1) ✅ habits (3) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:00:17 +02:00
Till JS	9161c0b3ab	feat(templates): two more non-AI templates + split gallery into two sections Closes out T1 with three templates per category as discussed. The gallery now renders agent-templates and workbench-templates as two distinct labeled sections — the earlier implicit "everything's a template for an agent" framing is gone. Seed handlers (new): - apps/mana/apps/web/src/lib/modules/habits/seed.ts — title-based idempotency (there's no description column on LocalHabit). If a non-deleted habit with the same title exists, the seed is skipped. - apps/mana/apps/web/src/lib/companion/goals/seed.ts — title-based idempotency on companionGoals where status !== 'abandoned'. - Both pulled in via side-effect imports in missions/setup.ts so the handler registry is populated before any apply. New templates: - 🏋️ Fitness (wellness) — scene body/habits/stretch/sleep + 3 habit seeds (Täglich 30min Bewegung, 3× Woche Training, 2L Wasser) + 1 goal seed (3 Workouts pro Woche). No agent. - 💻 Deep Work (work) — scene todo/calendar/notes/times + 2 habit seeds (1 wichtigste Aufgabe pro Tag, 4h Deep Work pro Tag) + 1 goal seed (20h Deep Work pro Woche). No agent. Gallery two-section layout: - Title "Templates" (not "Agent-Templates") — broader framing. - Section 1: "🤖 Agent-Templates" — filters ALL_TEMPLATES where category ∈ {'ai','delight'}: Recherche-Agent, Kontext-Agent, Today-Agent. - Section 2: "🎨 Workbench-Templates" — filters to the rest: Calmness, Fitness, Deep Work. - Each section gets a short intro paragraph so users understand the distinction before scanning the cards. - Cards themselves unchanged; rendering extracted into a {#snippet templateCard(t)} shared between both sections. - Per-category arrays computed once at module-load time (const in <script>); no per-render filter cost. Result: each section has 3 templates, categorised by "does this create an AI agent" rather than by use-case. Keeps the separation honest — Agent-Templates set up autonomous work; Workbench-Templates set up the user's own workspace. Tests: shared-ai 26/26, webapp svelte-check 0 errors, 0 warnings. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 11:45:40 +02:00
Till JS	a08e45ca16	feat(templates): generalise to WorkbenchTemplate + ship Calmness pilot (T1) First pass of the workbench-templates plan (docs/plans/workbench- templates.md) — templates are no longer agent-centric but a general "starter kit" bundle: optional agent + optional scene + optional missions + optional per-module seeds. Pilot non-AI template "Calmness" ships alongside. Shape generalisation (packages/shared-ai/src/agents/templates/types.ts): - AgentTemplate renamed to WorkbenchTemplate; all fields now optional (agent, scene, missions, seeds). Back-compat AgentTemplate alias kept so research/context/today keep compiling. - Added `category: 'ai'\|'wellness'\|'work'\|'lifeEvent'\|'delight'` + `icon` (for non-agent templates that have no avatar) + `version` field (for future update-detection). - New WorkbenchTemplateSeedItem shape: `{stableId?, data: unknown}`. Module-specific seed payloads are typed at the handler side. - Existing three AI templates nachgezogen: category='ai' (or 'delight' for today-agent), icon, version='1'. Seed infrastructure: - apps/mana/apps/web/src/lib/data/ai/agents/seed-registry.ts — in- memory handler map keyed by module name; module-local seed.ts files register themselves at import time. - apps/mana/apps/web/src/lib/modules/meditate/seed.ts — first handler: createPreset-based, idempotent via stableId embedded as HTML comment in the preset description (T1 pragmatism; T2 adds a proper column on the preset schema). - data/ai/missions/setup.ts pulls `import '$lib/modules/meditate/seed'` so the handler is registered before any template is applied. Applicator upgrades (data/ai/agents/apply-template.ts): - Agent step now optional — skipped cleanly when template has no agent part. - New step 4: seeds. Walks template.seeds, looks up the handler for each module, aggregates per-item outcomes (created/skipped-exists/ failed) into result.seedOutcomes. Missing handler = warning, not fatal. Crypto/encryption unchanged — seeds go through the same module stores that module code already uses. - Result shape gains `seedOutcomes: Record<string, SeedOutcome[]>` so the gallery can show "3 new, 1 already there". Calmness pilot (packages/shared-ai/src/agents/templates/calmness.ts): - category='wellness', NO agent, scene with meditate/mood/journal/ sleep apps, two meditate preset seeds: * 4-7-8 Atmung (breathing preset) * Body-Scan 10min (bodyscan preset with 9 scan steps) - Each seed has a stableId so re-apply is idempotent. Gallery updates (routes/(app)/agents/templates/+page.svelte): - Card avatar falls back to t.icon when no agent. "Agent" chip shows only for agent-templates; "N Seeds" chip shows for templates with seeds. - Detail header shows "Workbench-Setup ohne AI-Agent" when no agent. - New "Seeds" preview section: lists per-module counts + item names. - Options section gains a "Seed-Daten in Module einpflegen" checkbox. - Success panel shows seed summary: "3 Seeds neu, 1 bereits vorhanden". Tests: shared-ai 26/26, webapp svelte-check 0 errors, 0 warnings. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 01:07:41 +02:00
Till JS	8a5d200c84	fix(ai): bump planner maxTokens 1024→4096 + teach prompt about the loop Debug log from a "tag 4 notes" mission showed the planner's second-round response truncated mid-step: it was proposing one add_tag_to_note per listed note but ran out of tokens halfway through note #2. Parser rejected the malformed JSON → loop exited with 0 staged, user saw nothing to approve. Raising maxTokens to 4096 fits ~15-20 step objects, which covers the batch-tagging / batch-save pattern the reasoning loop is designed for. Also updating the system prompt so the planner actually knows about the loop it's running inside: read-only tools are announced as auto-executing with outputs visible next turn, and a new rule makes explicit that batch jobs must emit all write-steps in one plan (because staging a propose-tool ends the turn). Step count raised 1-5 → 1-10. Prompt snapshot tests still pass (they check structure, not text). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 00:55:18 +02:00
Till JS	7822340ea0	feat(ai-agents): Template gallery — 3 ready-to-use agent bundles First pass of the Multi-Agent discoverability UX. A new /agents/ templates route showcases pre-configured agents; clicking one creates agent + scene + starter mission(s) as a single bundle. Addresses the "blank form anxiety" + "user doesn't know what agents are for" observations from the UX brainstorm. Three templates for v1 (shared-ai/src/agents/templates/): - 🔍 Recherche-Agent — reads sources one by one, writes a note per source, summarizes into a report. Manual-cadence mission; all writes propose so user curates. - 🧭 Kontext-Agent — learns about the user via a weekly check-in. Reads kontext/notes/goals, asks 2-3 questions, proposes a diff- style context update. Weekly Sunday cadence. - 🌅 Today-Agent — researches "on this day" history each morning, writes a 4-8 line German poem, proposes a journal note. Daily 7am cadence. A "delight" agent, not a productive one. Each template packs (agent config, scene layout, starter mission): - AgentTemplate type lives in @mana/shared-ai — pure data, no runtime imports. Adding a new template = drop a file in templates/ and extend ALL_TEMPLATES. - Template-specific policies derive from the proposable-tool list so drift-guard catches divergence from the canonical set. - Starter missions default to startPaused=true — user sees the mission ready-to-go and hits Play when ready. Prevents surprise autonomous work on first apply. Applicator (data/ai/agents/apply-template.ts): - Creates agent → scene (if template defines one) → missions in order. Agent failure = abort; scene/mission failures surface as warnings in the result without blocking. - Duplicate-name handling: falls through to findByName, returns existing agent with wasExisting=true; scene is skipped in that case to avoid clone-proliferation. Gallery page /(app)/agents/templates/+page.svelte: - Three large cards side-by-side (stacks on mobile) with avatar / label / tagline / meta chips (Scene, N Missionen). - Click opens detail panel with full description, scene preview (app-ids + widths), mission preview (title / objective / cadence), and override checkboxes (create scene, create missions, start active vs paused). - Success panel shows what landed with warnings inline; CTA back to workbench. Discoverability in /ai-agents module: - Bar now has two buttons: "Aus Template" (primary, goto templates route) + "Eigener Agent" (secondary, opens the existing blank-form create mode). - When only the default "Mana" agent exists, render a dashed promo banner at the top linking to the template gallery. Disappears as soon as the user has a second agent. Tests: webapp svelte-check 0 errors, 0 warnings. shared-ai 26/26. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 00:36:39 +02:00
Till JS	4d9b16a683	feat(notes): list + update + append + add_tag tools for the AI Makes the "read all notes and tag them #Natur/#Technologie/…" use case fully functional. Four new ModuleTool entries in notes/tools.ts: - list_notes(limit?, query?, includeArchived?) — auto, read-only. Returns id + title + excerpt so the planner can reference concrete notes without dumping full bodies. - update_note(noteId, title?, content?) — proposable. Destructive full overwrite. Docstring nudges toward append_to_note when applicable. - append_to_note(noteId, content) — proposable, non-destructive. Handles the trailing-newline separator so markdown stays clean. - add_tag_to_note(noteId, tag) — proposable, idempotent, case-insensitive. Strips leading #, replaces spaces with _, skips if already present. Exactly the categorization primitive the user asked for. All three writes are added to AI_PROPOSABLE_TOOL_NAMES so both the webapp policy and mana-ai's boot-time drift guard agree (now 11 tools). Mirrored in services/mana-ai/src/planner/tools.ts. AiProposalInbox mounted on /notes so approvals land inline in the notes module too (already appears in the mission-detail cross-module inbox via the earlier commit). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 00:24:48 +02:00
Till JS	bc77b36234	feat(agents): Agent CRUD + default bootstrap + Mission.agentId (Phase 2) Second phase of the Multi-Agent Workbench rollout (docs/plans/ multi-agent-workbench.md). Builds on Phase 1's identity-aware Actor. Adds the Agent primitive — a named AI persona that owns Missions, carries its own policy + memory, and (from Phase 3 on) drives the Workbench lens. Everything is wired; a single user currently has one "Mana" default agent until the UI (Phase 5) lets them create more. Shared types (@mana/shared-ai): - agents/types.ts: Agent, AgentState, DEFAULT_AGENT_ID/NAME constants - policy/types.ts: AiPolicy + PolicyDecision (moved from webapp so Agent.policy can reference it without a runtime dep on the web app) - missions/types.ts: new optional Mission.agentId field Webapp data layer: - data/ai/agents/{types,store,queries,bootstrap}.ts - Dexie schema v19 adds `agents` table (indexes on state, name, [state+name]); sync registered under the existing ai app-id - Encryption registry: agents.systemPrompt + agents.memory encrypted; name/role/avatar/policy stay plaintext for search + UI rendering - DuplicateAgentNameError thrown at write time (not a Dexie unique index — bootstrap races between tabs would otherwise hit ConstraintError; store now resolves via getOrCreateAgent) - bootstrap.ts: ensureDefaultAgent + backfillMissionsAgentId. The backfill runs once per device (localStorage sentinel) so missions that pre-date the rollout get stamped with the default agent's id. Called fire-and-forget from startMissionTick() during layout init. Runner threading (already merged into `d5c351d63` via Till's debug-log commit that picked up my uncommitted edits): - runner.ts + server-iteration-staging.ts now resolve mission.agentId to the real Agent and build makeAgentActor with agent.name as displayName. Missing-agent fallback keeps using LEGACY_AI_PRINCIPAL so historical writes still attribute cleanly. Tests: shared-ai 26/26, mana-ai 35/35, svelte-check 0 errors. Agent store vitest suite is present but blocked by a pre-existing \$lib alias resolution issue in the webapp vitest config that predates this phase (proposals/store.test.ts is broken the same way on HEAD). Will address separately. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 20:35:49 +02:00
Till JS	d5c351d63e	feat(ai): per-iteration debug log — capture prompt + response + inputs New local-only Dexie table _aiDebugLog (v20, never synced) holds one row per mission iteration with the full system+user prompt, raw LLM response, latency, every ResolvedInput the planner saw, and pre-step state (kontext-injected? web-research-ok-or-error?). Capped at 50 newest rows. aiPlanTask always returns the captured prompt/response on AiPlanOutput. debug; the runner persists it only when isAiDebugEnabled() — toggled via a checkbox in the Mission detail header (defaults to on in DEV builds, off in prod, override via localStorage 'mana.ai.debug'). New <AiDebugBlock> component renders below each iteration card: expandable sections for Pre-Step, Resolved Inputs (each input individually collapsible), System Prompt, User Prompt, Raw Response, plus a "📋 JSON" copy-to-clipboard button for bug reports. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 20:33:17 +02:00
Till JS	1771063df4	refactor(actor): identity-aware Actor for Multi-Agent Workbench (Phase 1) Foundation for the Multi-Agent Workbench roadmap (docs/plans/multi-agent-workbench.md). Every event, record, and sync_changes row now carries a principal identity + cached display name in addition to the three-kind discriminator. Shape change (source of truth in @mana/shared-ai): Before: { kind: 'user' \| 'ai' \| 'system', ...kind-specific fields } After: discriminated union on kind, with - common: principalId, displayName - 'user': principalId = userId - 'ai': principalId = agentId + missionId/iterationId/rationale - 'system': principalId = one of SYSTEM_* sentinel strings ('system:projection', 'system:mission-runner', etc.) Key design calls (from the plan's Q&A): - System sub-sources get distinct principalIds (not a shared 'system' bucket) — lets Workbench filter + revert distinguish projection writes from migration writes from server-iteration writes - displayName cached on the record so renaming an agent doesn't rewrite history - normalizeActor() compat shim fills principalId/displayName on legacy rows with 'legacy:*' sentinels so historical events never crash the timeline New exports: - BaseActor / UserActor / AiActor / SystemActor (narrowed types) - makeUserActor, makeAgentActor, makeSystemActor (factories with typed return) - SYSTEM_PROJECTION, SYSTEM_RULE, SYSTEM_MIGRATION, SYSTEM_STREAM, SYSTEM_MISSION_RUNNER (principalId constants) - LEGACY_USER_PRINCIPAL, LEGACY_AI_PRINCIPAL, LEGACY_SYSTEM_PRINCIPAL - isUserActor / isFromMissionRunner predicates Webapp: - data/events/actor.ts now re-exports from shared-ai, keeps runtime ambient-context (runAs, getCurrentActor) local - bindDefaultUser(userId, displayName) lets the auth layer replace the legacy placeholder with the real logged-in user actor at login - Mission runner + server-iteration-staging stamp LEGACY_AI_PRINCIPAL as the agentId placeholder — Phase 2 will thread the real agent - Streaks projection uses makeSystemActor(SYSTEM_PROJECTION) - All test fixtures migrated to factories Service: - mana-ai/db/iteration-writer.ts stamps makeSystemActor( SYSTEM_MISSION_RUNNER) instead of the old { kind:'system', source:'mission-runner' } shape. Phase 3 will switch this to an agent actor per mission. Tests: 26 shared-ai + 21 webapp vitest + 35 mana-ai — all green. svelte-check: 0 errors, 0 warnings. No behavior change; purely a type + shape upgrade. Old sync_changes rows parse via the normalizeActor compat shim at read time. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 20:13:57 +02:00
Till JS	fdb8e60d07	feat(ai): web-research pre-step + auto-kontext + save_news_article tool Mission objectives matching /recherch\|research\|news\|finde\|suche\|aktuelle\|neueste/i trigger a synchronous deep-research call (mana-search + mana-llm via the existing /api/v1/research/start-sync pipeline) before the planner runs; the summary plus top-8 source URLs are injected as a synthetic ResolvedInput so the planner can stage save_news_article proposals against real URLs. The kontext singleton is auto-attached to every mission's planner input (decrypted client-side, gated on non-empty content + not already linked). save_news_article is a new proposable tool routed through articlesStore .saveFromUrl (Readability via /api/v1/news/extract/save). AiProposalInbox mounted on /news so the user can approve/reject inline. mana-ai planner tool list mirrors the new tool to keep the boot-time drift guard happy. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 19:10:13 +02:00
Till JS	2497a65937	feat(ai-missions): richer error surfacing + retry button on failed runs Replaces the single-line summary ("Planner failed: fetch …") with full diagnostic detail: error name + message + last-active phase + stack trace, all persisted onto the iteration itself. UI expands a collapsed details block next to each failed iteration, so the user can see where it broke ("TypeError in calling-llm") without opening DevTools. Paired with a one-click Retry button that re-runs the mission under the same config — useful while debugging a flaky backend (GPU server down, Gemini quota, etc.). - `packages/shared-ai/src/missions/types.ts` — new `MissionIteration.errorDetails: { name, message, phase?, stack? }` - `finishIteration` accepts the field, deep-clones it, and also now clears the transient phase markers (currentPhase/phaseStartedAt/ phaseDetail/cancelRequested) whenever an iteration finalises — keeps the schema honest (phases are sub-state of \`running\` only). - `runMission` tracks \`lastPhase\` via a new \`enterPhase\` helper that wraps setIterationPhase. The catch handler populates errorDetails with lastPhase + message + stack. - ListView: \`<details>\` block under each failed iteration + Retry button (disabled while another run is in-flight). 77/77 webapp tests still green; svelte-check clean. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 14:37:15 +02:00
Till JS	ef47adb7d7	feat(ai-missions): live phase + elapsed + cancel for running iterations Closes the "iteration is running, no feedback" black hole. The user now sees, per running iteration: ⏳ Frage Planner · frage Planner an ⏱ 23s [Abbrechen] Phases (\`IterationPhase\`): resolving-inputs → calling-llm → parsing-response → staging-proposals → finalizing The runner advances through these via \`setIterationPhase\` between each await, writing currentPhase + phaseDetail + phaseStartedAt onto the iteration. UI reads them via Dexie liveQuery — no polling. Cancel: - \`requestIterationCancel\` writes cancelRequested=true on the iteration - runner polls \`isCancelRequested\` between every phase + per stage step - cancellation finalises as \`failed\` with summary \`'cancelled by user'\` - UI button is disabled + relabelled "Wird abgebrochen…" until the next poll picks it up Hard timeout: 90 s wall-clock per iteration via Promise.race against a CancelledError. Wedged backends (e.g. flaky mana-llm) fail fast with "timeout after 90s" instead of sitting in \`running\` forever. Elapsed counter is a \$state variable ticking once a second, scoped to the ListView component — Dexie isn't touched. Auto-cleaned on component destroy. shared-ai re-exports \`IterationPhase\` so server-side mana-ai can inspect the same phase enum (no consumer there yet, but the type is ready for the run-status endpoint planned in HEALTH page). 77/77 webapp tests still green; svelte-check clean. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 14:15:48 +02:00
Till JS	6882ffb626	feat(shared-ai): Mission Key-Grant contract + plan for encrypted server-side runs Foundation for Phase 2+ of the Mission Key-Grant flow: lets mana-ai execute missions that depend on encrypted inputs (notes/tasks/events/ journal/kontext) without needing an open browser tab. Opt-in per mission, Zero-Knowledge users excluded. - Canonical HKDF-SHA256 derivation (scope-bound via tables + recordIds in the HKDF info string → scope changes invalidate the grant cryptographically, not just via a runtime check) - Mission.grant field on the shared Mission type - Golden snapshot + drift-guard test so webapp wrap path and mana-auth wrap endpoint can't silently diverge - Ideas backlog at docs/future/AI_AGENTS_IDEAS.md - Full rollout plan at docs/plans/ai-mission-key-grant.md - COMPANION_BRAIN_ARCHITECTURE.md §21 captures the flow + privacy guarantees + non-goals Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 13:41:35 +02:00
Till JS	4be5e29bd3	feat(shared-ai): canonical proposable-tool list + drift guard on mana-ai Makes the webapp's AI policy and the server's tool allow-list physically impossible to drift. Adds the missing entries the guard caught on first run: `complete_tasks_by_title`, `visit_place`, `undo_drink` now have parameter schemas server-side too. - `packages/shared-ai/src/policy/proposable-tools.ts` - `AI_PROPOSABLE_TOOL_NAMES` as `const` array + literal union type - `AI_PROPOSABLE_TOOL_SET` for set-membership checks - Webapp `DEFAULT_AI_POLICY` derives its `propose` entries from the shared list via `Object.fromEntries(...)` — adding a tool there is now a one-line change in `@mana/shared-ai` - mana-ai `AI_AVAILABLE_TOOLS`: module-load assertion compares its hardcoded names against `AI_PROPOSABLE_TOOL_SET` and throws with a pointed error on drift (extras in one direction, missing in the other). Service refuses to start on mismatch — better than silent degradation. - Bun test (`tools.test.ts`) runs the same contract plus sanity checks (non-empty description, required params carry docs). Vitest policy test adds the symmetric check on the webapp side. All three runtimes now green: webapp 66/66, shared-ai 2/2, mana-ai 9/9 Bun tests. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 00:52:38 +02:00
Till JS	5e01763caa	feat(ai): close the loop — server write-back + webapp staging effect Completes the off-tab AI pipeline. mana-ai now writes produced plans back to `sync_changes` as a server-sourced Mission iteration; the webapp picks it up on next sync and translates each PlanStep into a local Proposal via the existing createProposal flow. User sees the resulting ghost cards in the matching module's AiProposalInbox with full mission attribution. Server (mana-ai v0.3): - `db/connection.ts` — `withUser(sql, userId, fn)` RLS-scoped tx helper mirroring the Go `withUser` pattern (SET LOCAL app.current_user_id) - `db/iteration-writer.ts` - `planToIteration(plan, id, now)` — shared-ai AiPlanOutput → inline MissionIteration with `source: 'server'` + status='awaiting-review' - `appendServerIteration(sql, input)` — INSERT sync_changes row with op=update, data={iterations: [...]} + field_timestamps + actor JSONB={kind:'system', source:'mission-runner'} - `cron/tick.ts` — after parse success: build iteration, append to mission.iterations, persist via appendServerIteration. Stats now include `plansWrittenBack`. Actor union: - `packages/shared-ai/src/actor.ts` + webapp actor: `system.source` gains `'mission-runner'` so the server's own writes are attributed correctly and distinguishable from projection/rule writes Webapp: - `data/ai/missions/server-iteration-staging.ts` - `startServerIterationStaging()` subscribes to aiMissions via Dexie liveQuery; on each Mission update, walks iterations looking for `source='server'` entries that haven't been staged yet - For each such iteration: creates a Proposal per PlanStep under `{kind:'ai', missionId, iterationId, rationale}` so policy + hooks fire correctly - Writes proposalIds back into plan[].proposalId + status='staged' so other tabs and app restarts skip re-staging - Idempotent: in-memory `processedIterations` Set + durable proposalId marker - Wired into (app)/+layout.svelte alongside startMissionTick - 3 unit tests: translate server iteration → proposal, skip already-staged, ignore browser iterations Full pipeline now: user creates Mission in /companion/missions → mana-ai tick picks it up → calls mana-llm → parses plan → writes iteration → synced to webapp → staging effect creates proposals → user approves in /todo (or any module) → task lands with `{actor: ai, missionId, iterationId, rationale}` attribution. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 00:29:30 +02:00
Till JS	0d90b12d1c	feat(shared-ai): extract planner + mission types to @mana/shared-ai Single source of truth for AI Workbench types shared between the webapp (Vite/SvelteKit) and the server-side mana-ai Bun service. Prevents the two runtimes from drifting on prompt shape or mission structure. - `@mana/shared-ai` package: - `actor.ts` — Actor union (user \| ai \| system) + helpers, mirrors the webapp's runtime type so server-side consumers parse incoming actors without re-declaring - `missions/types.ts` — Mission, MissionCadence, MissionInputRef, MissionIteration, PlanStep, MissionState. Adds optional `iteration.source: 'browser' \| 'server'` to distinguish foreground vs server-produced iterations (groundwork for proposal write-back) - `planner/prompt.ts` — `buildPlannerPrompt` pure function - `planner/parser.ts` — `parsePlannerResponse` strict JSON validator - Vitest smoke tests (2) cover prompt → parse round-trip + unknown- tool rejection - Webapp: - `missions/types.ts` re-exports from shared-ai, keeps webapp-local `MISSIONS_TABLE` constant + `planStepStatusFromProposal` bridge - `missions/planner/{types,prompt,parser}.ts` become re-export stubs so existing imports keep working unchanged - Existing webapp tests (60) continue to pass — the wire code didn't move, just its home Next: mana-ai service imports buildPlannerPrompt/parsePlannerResponse from shared-ai + wires mana-llm + writes iteration back as a 'source=server' row (tracked in services/mana-ai/CLAUDE.md). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 00:01:57 +02:00

23 commits