managarten

mirror of https://github.com/Memo-2023/mana-monorepo.git synced 2026-05-16 03:19:40 +02:00

Author	SHA1	Message	Date
Till JS	bc77b36234	feat(agents): Agent CRUD + default bootstrap + Mission.agentId (Phase 2) Second phase of the Multi-Agent Workbench rollout (docs/plans/ multi-agent-workbench.md). Builds on Phase 1's identity-aware Actor. Adds the Agent primitive — a named AI persona that owns Missions, carries its own policy + memory, and (from Phase 3 on) drives the Workbench lens. Everything is wired; a single user currently has one "Mana" default agent until the UI (Phase 5) lets them create more. Shared types (@mana/shared-ai): - agents/types.ts: Agent, AgentState, DEFAULT_AGENT_ID/NAME constants - policy/types.ts: AiPolicy + PolicyDecision (moved from webapp so Agent.policy can reference it without a runtime dep on the web app) - missions/types.ts: new optional Mission.agentId field Webapp data layer: - data/ai/agents/{types,store,queries,bootstrap}.ts - Dexie schema v19 adds `agents` table (indexes on state, name, [state+name]); sync registered under the existing ai app-id - Encryption registry: agents.systemPrompt + agents.memory encrypted; name/role/avatar/policy stay plaintext for search + UI rendering - DuplicateAgentNameError thrown at write time (not a Dexie unique index — bootstrap races between tabs would otherwise hit ConstraintError; store now resolves via getOrCreateAgent) - bootstrap.ts: ensureDefaultAgent + backfillMissionsAgentId. The backfill runs once per device (localStorage sentinel) so missions that pre-date the rollout get stamped with the default agent's id. Called fire-and-forget from startMissionTick() during layout init. Runner threading (already merged into `d5c351d63` via Till's debug-log commit that picked up my uncommitted edits): - runner.ts + server-iteration-staging.ts now resolve mission.agentId to the real Agent and build makeAgentActor with agent.name as displayName. Missing-agent fallback keeps using LEGACY_AI_PRINCIPAL so historical writes still attribute cleanly. Tests: shared-ai 26/26, mana-ai 35/35, svelte-check 0 errors. Agent store vitest suite is present but blocked by a pre-existing \$lib alias resolution issue in the webapp vitest config that predates this phase (proposals/store.test.ts is broken the same way on HEAD). Will address separately. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 20:35:49 +02:00
Till JS	d5c351d63e	feat(ai): per-iteration debug log — capture prompt + response + inputs New local-only Dexie table _aiDebugLog (v20, never synced) holds one row per mission iteration with the full system+user prompt, raw LLM response, latency, every ResolvedInput the planner saw, and pre-step state (kontext-injected? web-research-ok-or-error?). Capped at 50 newest rows. aiPlanTask always returns the captured prompt/response on AiPlanOutput. debug; the runner persists it only when isAiDebugEnabled() — toggled via a checkbox in the Mission detail header (defaults to on in DEV builds, off in prod, override via localStorage 'mana.ai.debug'). New <AiDebugBlock> component renders below each iteration card: expandable sections for Pre-Step, Resolved Inputs (each input individually collapsible), System Prompt, User Prompt, Raw Response, plus a "📋 JSON" copy-to-clipboard button for bug reports. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 20:33:17 +02:00
Till JS	1771063df4	refactor(actor): identity-aware Actor for Multi-Agent Workbench (Phase 1) Foundation for the Multi-Agent Workbench roadmap (docs/plans/multi-agent-workbench.md). Every event, record, and sync_changes row now carries a principal identity + cached display name in addition to the three-kind discriminator. Shape change (source of truth in @mana/shared-ai): Before: { kind: 'user' \| 'ai' \| 'system', ...kind-specific fields } After: discriminated union on kind, with - common: principalId, displayName - 'user': principalId = userId - 'ai': principalId = agentId + missionId/iterationId/rationale - 'system': principalId = one of SYSTEM_* sentinel strings ('system:projection', 'system:mission-runner', etc.) Key design calls (from the plan's Q&A): - System sub-sources get distinct principalIds (not a shared 'system' bucket) — lets Workbench filter + revert distinguish projection writes from migration writes from server-iteration writes - displayName cached on the record so renaming an agent doesn't rewrite history - normalizeActor() compat shim fills principalId/displayName on legacy rows with 'legacy:*' sentinels so historical events never crash the timeline New exports: - BaseActor / UserActor / AiActor / SystemActor (narrowed types) - makeUserActor, makeAgentActor, makeSystemActor (factories with typed return) - SYSTEM_PROJECTION, SYSTEM_RULE, SYSTEM_MIGRATION, SYSTEM_STREAM, SYSTEM_MISSION_RUNNER (principalId constants) - LEGACY_USER_PRINCIPAL, LEGACY_AI_PRINCIPAL, LEGACY_SYSTEM_PRINCIPAL - isUserActor / isFromMissionRunner predicates Webapp: - data/events/actor.ts now re-exports from shared-ai, keeps runtime ambient-context (runAs, getCurrentActor) local - bindDefaultUser(userId, displayName) lets the auth layer replace the legacy placeholder with the real logged-in user actor at login - Mission runner + server-iteration-staging stamp LEGACY_AI_PRINCIPAL as the agentId placeholder — Phase 2 will thread the real agent - Streaks projection uses makeSystemActor(SYSTEM_PROJECTION) - All test fixtures migrated to factories Service: - mana-ai/db/iteration-writer.ts stamps makeSystemActor( SYSTEM_MISSION_RUNNER) instead of the old { kind:'system', source:'mission-runner' } shape. Phase 3 will switch this to an agent actor per mission. Tests: 26 shared-ai + 21 webapp vitest + 35 mana-ai — all green. svelte-check: 0 errors, 0 warnings. No behavior change; purely a type + shape upgrade. Old sync_changes rows parse via the normalizeActor compat shim at read time. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 20:13:57 +02:00
Till JS	fdb8e60d07	feat(ai): web-research pre-step + auto-kontext + save_news_article tool Mission objectives matching /recherch\|research\|news\|finde\|suche\|aktuelle\|neueste/i trigger a synchronous deep-research call (mana-search + mana-llm via the existing /api/v1/research/start-sync pipeline) before the planner runs; the summary plus top-8 source URLs are injected as a synthetic ResolvedInput so the planner can stage save_news_article proposals against real URLs. The kontext singleton is auto-attached to every mission's planner input (decrypted client-side, gated on non-empty content + not already linked). save_news_article is a new proposable tool routed through articlesStore .saveFromUrl (Readability via /api/v1/news/extract/save). AiProposalInbox mounted on /news so the user can approve/reject inline. mana-ai planner tool list mirrors the new tool to keep the boot-time drift guard happy. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 19:10:13 +02:00
Till JS	2497a65937	feat(ai-missions): richer error surfacing + retry button on failed runs Replaces the single-line summary ("Planner failed: fetch …") with full diagnostic detail: error name + message + last-active phase + stack trace, all persisted onto the iteration itself. UI expands a collapsed details block next to each failed iteration, so the user can see where it broke ("TypeError in calling-llm") without opening DevTools. Paired with a one-click Retry button that re-runs the mission under the same config — useful while debugging a flaky backend (GPU server down, Gemini quota, etc.). - `packages/shared-ai/src/missions/types.ts` — new `MissionIteration.errorDetails: { name, message, phase?, stack? }` - `finishIteration` accepts the field, deep-clones it, and also now clears the transient phase markers (currentPhase/phaseStartedAt/ phaseDetail/cancelRequested) whenever an iteration finalises — keeps the schema honest (phases are sub-state of \`running\` only). - `runMission` tracks \`lastPhase\` via a new \`enterPhase\` helper that wraps setIterationPhase. The catch handler populates errorDetails with lastPhase + message + stack. - ListView: \`<details>\` block under each failed iteration + Retry button (disabled while another run is in-flight). 77/77 webapp tests still green; svelte-check clean. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 14:37:15 +02:00
Till JS	ef47adb7d7	feat(ai-missions): live phase + elapsed + cancel for running iterations Closes the "iteration is running, no feedback" black hole. The user now sees, per running iteration: ⏳ Frage Planner · frage Planner an ⏱ 23s [Abbrechen] Phases (\`IterationPhase\`): resolving-inputs → calling-llm → parsing-response → staging-proposals → finalizing The runner advances through these via \`setIterationPhase\` between each await, writing currentPhase + phaseDetail + phaseStartedAt onto the iteration. UI reads them via Dexie liveQuery — no polling. Cancel: - \`requestIterationCancel\` writes cancelRequested=true on the iteration - runner polls \`isCancelRequested\` between every phase + per stage step - cancellation finalises as \`failed\` with summary \`'cancelled by user'\` - UI button is disabled + relabelled "Wird abgebrochen…" until the next poll picks it up Hard timeout: 90 s wall-clock per iteration via Promise.race against a CancelledError. Wedged backends (e.g. flaky mana-llm) fail fast with "timeout after 90s" instead of sitting in \`running\` forever. Elapsed counter is a \$state variable ticking once a second, scoped to the ListView component — Dexie isn't touched. Auto-cleaned on component destroy. shared-ai re-exports \`IterationPhase\` so server-side mana-ai can inspect the same phase enum (no consumer there yet, but the type is ready for the run-status endpoint planned in HEALTH page). 77/77 webapp tests still green; svelte-check clean. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 14:15:48 +02:00
Till JS	6882ffb626	feat(shared-ai): Mission Key-Grant contract + plan for encrypted server-side runs Foundation for Phase 2+ of the Mission Key-Grant flow: lets mana-ai execute missions that depend on encrypted inputs (notes/tasks/events/ journal/kontext) without needing an open browser tab. Opt-in per mission, Zero-Knowledge users excluded. - Canonical HKDF-SHA256 derivation (scope-bound via tables + recordIds in the HKDF info string → scope changes invalidate the grant cryptographically, not just via a runtime check) - Mission.grant field on the shared Mission type - Golden snapshot + drift-guard test so webapp wrap path and mana-auth wrap endpoint can't silently diverge - Ideas backlog at docs/future/AI_AGENTS_IDEAS.md - Full rollout plan at docs/plans/ai-mission-key-grant.md - COMPANION_BRAIN_ARCHITECTURE.md §21 captures the flow + privacy guarantees + non-goals Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 13:41:35 +02:00
Till JS	4be5e29bd3	feat(shared-ai): canonical proposable-tool list + drift guard on mana-ai Makes the webapp's AI policy and the server's tool allow-list physically impossible to drift. Adds the missing entries the guard caught on first run: `complete_tasks_by_title`, `visit_place`, `undo_drink` now have parameter schemas server-side too. - `packages/shared-ai/src/policy/proposable-tools.ts` - `AI_PROPOSABLE_TOOL_NAMES` as `const` array + literal union type - `AI_PROPOSABLE_TOOL_SET` for set-membership checks - Webapp `DEFAULT_AI_POLICY` derives its `propose` entries from the shared list via `Object.fromEntries(...)` — adding a tool there is now a one-line change in `@mana/shared-ai` - mana-ai `AI_AVAILABLE_TOOLS`: module-load assertion compares its hardcoded names against `AI_PROPOSABLE_TOOL_SET` and throws with a pointed error on drift (extras in one direction, missing in the other). Service refuses to start on mismatch — better than silent degradation. - Bun test (`tools.test.ts`) runs the same contract plus sanity checks (non-empty description, required params carry docs). Vitest policy test adds the symmetric check on the webapp side. All three runtimes now green: webapp 66/66, shared-ai 2/2, mana-ai 9/9 Bun tests. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 00:52:38 +02:00
Till JS	5e01763caa	feat(ai): close the loop — server write-back + webapp staging effect Completes the off-tab AI pipeline. mana-ai now writes produced plans back to `sync_changes` as a server-sourced Mission iteration; the webapp picks it up on next sync and translates each PlanStep into a local Proposal via the existing createProposal flow. User sees the resulting ghost cards in the matching module's AiProposalInbox with full mission attribution. Server (mana-ai v0.3): - `db/connection.ts` — `withUser(sql, userId, fn)` RLS-scoped tx helper mirroring the Go `withUser` pattern (SET LOCAL app.current_user_id) - `db/iteration-writer.ts` - `planToIteration(plan, id, now)` — shared-ai AiPlanOutput → inline MissionIteration with `source: 'server'` + status='awaiting-review' - `appendServerIteration(sql, input)` — INSERT sync_changes row with op=update, data={iterations: [...]} + field_timestamps + actor JSONB={kind:'system', source:'mission-runner'} - `cron/tick.ts` — after parse success: build iteration, append to mission.iterations, persist via appendServerIteration. Stats now include `plansWrittenBack`. Actor union: - `packages/shared-ai/src/actor.ts` + webapp actor: `system.source` gains `'mission-runner'` so the server's own writes are attributed correctly and distinguishable from projection/rule writes Webapp: - `data/ai/missions/server-iteration-staging.ts` - `startServerIterationStaging()` subscribes to aiMissions via Dexie liveQuery; on each Mission update, walks iterations looking for `source='server'` entries that haven't been staged yet - For each such iteration: creates a Proposal per PlanStep under `{kind:'ai', missionId, iterationId, rationale}` so policy + hooks fire correctly - Writes proposalIds back into plan[].proposalId + status='staged' so other tabs and app restarts skip re-staging - Idempotent: in-memory `processedIterations` Set + durable proposalId marker - Wired into (app)/+layout.svelte alongside startMissionTick - 3 unit tests: translate server iteration → proposal, skip already-staged, ignore browser iterations Full pipeline now: user creates Mission in /companion/missions → mana-ai tick picks it up → calls mana-llm → parses plan → writes iteration → synced to webapp → staging effect creates proposals → user approves in /todo (or any module) → task lands with `{actor: ai, missionId, iterationId, rationale}` attribution. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 00:29:30 +02:00
Till JS	0d90b12d1c	feat(shared-ai): extract planner + mission types to @mana/shared-ai Single source of truth for AI Workbench types shared between the webapp (Vite/SvelteKit) and the server-side mana-ai Bun service. Prevents the two runtimes from drifting on prompt shape or mission structure. - `@mana/shared-ai` package: - `actor.ts` — Actor union (user \| ai \| system) + helpers, mirrors the webapp's runtime type so server-side consumers parse incoming actors without re-declaring - `missions/types.ts` — Mission, MissionCadence, MissionInputRef, MissionIteration, PlanStep, MissionState. Adds optional `iteration.source: 'browser' \| 'server'` to distinguish foreground vs server-produced iterations (groundwork for proposal write-back) - `planner/prompt.ts` — `buildPlannerPrompt` pure function - `planner/parser.ts` — `parsePlannerResponse` strict JSON validator - Vitest smoke tests (2) cover prompt → parse round-trip + unknown- tool rejection - Webapp: - `missions/types.ts` re-exports from shared-ai, keeps webapp-local `MISSIONS_TABLE` constant + `planStepStatusFromProposal` bridge - `missions/planner/{types,prompt,parser}.ts` become re-export stubs so existing imports keep working unchanged - Existing webapp tests (60) continue to pass — the wire code didn't move, just its home Next: mana-ai service imports buildPlannerPrompt/parsePlannerResponse from shared-ai + wires mana-llm + writes iteration back as a 'source=server' row (tracked in services/mana-ai/CLAUDE.md). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 00:01:57 +02:00

10 commits