mirror of https://github.com/Memo-2023/mana-monorepo.git synced 2026-05-14 20:01:09 +02:00

Mirror of github.com/Memo-2023/mana-monorepo

Find a file

Till JS 1f26aa4f2f feat(local-llm): swap WebLLM/Qwen for transformers.js + Gemma 4 E2B Replace the entire @mana/local-llm engine with a transformers.js-based implementation backed by Google's Gemma 4 E2B (released 2026-04-02). The external API of LocalLLMEngine — load(), generate(), prompt(), extractJson(), classify(), onStatusChange(), isSupported() — is preserved 1:1, so the /llm-test page, the playground module, and the Svelte 5 reactive bindings in svelte.svelte.ts need no changes beyond updating the default model key. Why the engine swap: MLC has not (and as of today still hasn't) published Gemma 4 builds for WebLLM. The webml-community team and HuggingFace's onnx-community already have Gemma 4 E2B running in the browser via transformers.js + WebGPU, with a documented Gemma4ForConditionalGeneration class shipped in @huggingface/transformers v4.0.0. Going through the ONNX route gets us the latest Google model six days after release instead of waiting on MLC compilation. Trade-offs accepted (discussed before this commit): - transformers.js is a more generic ONNX runtime, so per-token throughput will be ~20-40% lower than WebLLM would deliver for the same model size. For a 2B model on a modern WebGPU device that's still well above interactive latency. - The JS bundle gains ~2-3 MB (the ONNX runtime). Negligible compared to the 500 MB model download. - transformers.js v4 is brand new (released alongside Gemma 4) so the Gemma4ForConditionalGeneration code path has very little battle testing yet. The risk is partially offset by webml-community's reference implementation. What changed file by file: - packages/local-llm/package.json: drop @mlc-ai/web-llm, add @huggingface/transformers ^4.0.0; bump version 0.1.0 → 0.2.0; rewrite description. - packages/local-llm/src/types.ts: add `dtype` field to ModelConfig ('fp32' \| 'fp16' \| 'q8' \| 'q4' \| 'q4f16') so each model can request the quantization that matches its uploaded ONNX shards. - packages/local-llm/src/models.ts: replace the old Qwen 2.5 + Gemma 2 registry with a single `gemma-4-e2b` entry pointing at onnx-community/gemma-4-E2B-it-ONNX with q4f16 quantization. Future models can be added by appending entries — the /llm-test picker reads MODELS dynamically and picks them up automatically. - packages/local-llm/src/cache.ts: replace the WebLLM-specific hasModelInCache helper with a generic Cache API probe that looks for `https://huggingface.co/{model_id}/resolve/main/tokenizer.json` in any open cache. tokenizer.json is small, downloaded first, and always present, so its presence is a reliable proxy for "model has been loaded before". - packages/local-llm/src/engine.ts: full rewrite. Internally we now hold a transformers.js model + processor pair (created via AutoProcessor.from_pretrained + Gemma4ForConditionalGeneration.from_pretrained with `device: 'webgpu'`), and translate our LoadingStatus union from the library's `progress_callback` shape. generate() applies Gemma's chat template via the processor, runs model.generate() with optional TextStreamer for streaming, then slices the prompt tokens off the output tensor to compute per-call usage. The convenience methods (prompt, extractJson, classify) are unchanged because they only call generate() under the hood. - packages/local-llm/src/generate.ts and status.svelte.ts: deleted. These were orphaned from a much earlier engine API (referenced `getEngine()` / `subscribe()` / `LlmState` symbols that haven't existed for a while) and were never re-exported from index.ts — they only showed up because `tsc --noEmit` was crawling the src tree. Their functionality lives in engine.ts + svelte.svelte.ts now. - apps/mana/apps/web/package.json: swap the direct dep from @mlc-ai/web-llm to @huggingface/transformers. This is the same trick we used for the previous adapter-node externals warning — having it as a direct dep makes adapter-node's Rollup pass treat it as external automatically. - apps/mana/apps/web/vite.config.ts: swap ssr.external entry from @mlc-ai/web-llm to @huggingface/transformers. Add a comment explaining the why so the next person doesn't wonder. - apps/mana/apps/web/src/routes/(app)/llm-test/+page.svelte: change the default selectedModel from 'qwen-2.5-1.5b' to 'gemma-4-e2b'. All other model display strings come from the MODELS registry, so this is the single hard-coded reference that needed updating. - pnpm-lock.yaml: regenerated. Confirmed @mlc-ai/web-llm is gone (0 references) and @huggingface/transformers is in (4 references). CSP: no header changes needed. We already opened connect-src for huggingface.co + cdn-lfs.huggingface.co + raw.githubusercontent.com when fixing the WebLLM blockers earlier today, and 'wasm-unsafe-eval' is already in script-src — both transformers.js (ONNX runtime) and WebLLM (MLC runtime) need that. If transformers.js spawns its inference into a Web Worker via a blob URL we may need to add `worker-src 'self' blob:` once we hit the first runtime test, but the existing CSP should be enough for the synchronous path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>		2026-04-08 22:22:32 +02:00
.changeset	feat(versioning): add semantic versioning and changesets to all apps	2026-03-19 16:20:18 +01:00
.claude	feat(manacore/web): wire TagField, FavoriteButton, ColorPicker into module UIs	2026-04-02 17:20:46 +02:00
.github	test(integration): end-to-end auth flow test with Mailpit + CI gating	2026-04-08 17:14:02 +02:00
.husky	fix(devtools): fix pre-commit hook - add eslint-config dep, remove type-check	2026-03-17 13:08:51 +01:00
apps	feat(local-llm): swap WebLLM/Qwen for transformers.js + Gemma 4 E2B	2026-04-08 22:22:32 +02:00
docker	fix(docker): drop stale shared-subscription-* COPY lines from sveltekit-base	2026-04-08 18:28:59 +02:00
docs	feat(env): persistent dev secrets via .env.secrets override	2026-04-08 17:50:37 +02:00
games	chore: complete ManaCore → Mana rename (docs, go modules, plists, images)	2026-04-07 12:26:10 +02:00
load-tests	chore: rename mukke to music in infra, scripts, and CI/CD	2026-04-05 16:47:57 +02:00
NewAppIdeas/Roblox Reimagined	chore: complete ManaCore → Mana rename (docs, go modules, plists, images)	2026-04-07 12:26:10 +02:00
packages	feat(local-llm): swap WebLLM/Qwen for transformers.js + Gemma 4 E2B	2026-04-08 22:22:32 +02:00
patches	fix(traces): configure EAS Build for TestFlight and fix bot-services build	2026-03-17 13:16:38 +01:00
scripts	fix(mana-auth): account lockout was structurally dead + add failure-path tests	2026-04-08 18:29:00 +02:00
services	fix(mana-auth): account lockout was structurally dead + add failure-path tests	2026-04-08 18:29:00 +02:00
tests	fix(mana-auth): account lockout was structurally dead + add failure-path tests	2026-04-08 18:29:00 +02:00
.dockerignore	make auth working	2025-11-26 01:31:12 +01:00
.editorconfig	feat: add monorepo configuration and shared packages structure	2025-11-22 23:41:52 +01:00
.env.development	chore(env): default MANA_LLM_URL to llm.mana.how	2026-04-08 16:55:01 +02:00
.env.macmini.example	docs: Phase 9 documentation roundup — close encryption-shaped doc gaps	2026-04-08 11:47:59 +02:00
.env.secrets.example	feat(env): persistent dev secrets via .env.secrets override	2026-04-08 17:50:37 +02:00
.gitignore	feat(env): persistent dev secrets via .env.secrets override	2026-04-08 17:50:37 +02:00
.npmrc	fix(monorepo): add .npmrc with node-linker=hoisted for EAS Build compatibility	2026-03-15 08:50:18 +01:00
.nvmrc	feat: add monorepo configuration and shared packages structure	2025-11-22 23:41:52 +01:00
.prettierignore	chore: complete ManaCore → Mana rename (docs, go modules, plists, images)	2026-04-07 12:26:10 +02:00
.prettierrc.json	fix(cicd): docker paths, formatting config,	2025-11-27 18:33:08 +01:00
CLAUDE.md	fix(mana-auth) + chore: rewrite /api/v1/auth/login JWT mint, remove Matrix stack	2026-04-08 16:32:13 +02:00
cloudflared-config.yml	fix(mana-auth) + chore: rewrite /api/v1/auth/login JWT mint, remove Matrix stack	2026-04-08 16:32:13 +02:00
docker-compose.dev.yml	test(integration): end-to-end auth flow test with Mailpit + CI gating	2026-04-08 17:14:02 +02:00
docker-compose.macmini.yml	fix(macmini): mount prometheus config directly so /-/reload picks up edits	2026-04-08 17:25:48 +02:00
docker-compose.test.yml	test(integration): end-to-end auth flow test with Mailpit + CI gating	2026-04-08 17:14:02 +02:00
eslint.config.mjs	chore: complete ManaCore → Mana rename (docs, go modules, plists, images)	2026-04-07 12:26:10 +02:00
gift-codes-2026-02-14.txt	✨ feat(gifts): add gift code creation script and initial codes	2026-02-14 11:23:08 +01:00
lint-staged.config.js	chore: archive 17 standalone app servers (replaced by unified API)	2026-04-02 21:37:45 +02:00
package.json	feat(env): persistent dev secrets via .env.secrets override	2026-04-08 17:50:37 +02:00
playwright.config.ts	style: auto-format codebase with Prettier	2025-11-27 18:33:16 +01:00
pnpm-lock.yaml	feat(local-llm): swap WebLLM/Qwen for transformers.js + Gemma 4 E2B	2026-04-08 22:22:32 +02:00
pnpm-workspace.yaml	chore: delete 25 web-archived directories, remove stale stubs, clean workspace config	2026-04-03 13:03:49 +02:00
README.md	chore: complete ManaCore → Mana rename (docs, go modules, plists, images)	2026-04-07 12:26:10 +02:00
TROUBLESHOOTING.md	chore: complete ManaCore → Mana rename (docs, go modules, plists, images)	2026-04-07 12:26:10 +02:00
turbo.json	feat: rename ManaCore to Mana across entire codebase	2026-04-05 20:00:13 +02:00
vitest.config.ts	feat: rename ManaCore to Mana across entire codebase	2026-04-05 20:00:13 +02:00

README.md

Mana Monorepo

Monorepo containing all Mana projects — a self-hosted multi-app ecosystem with shared packages and unified tooling.

Projects

Project	Description	Apps
mana	Multi-app ecosystem platform	Expo mobile, SvelteKit web
chat	AI chat application	NestJS backend, Expo mobile, SvelteKit web, Astro landing
todo	Task management	NestJS backend, SvelteKit web, Astro landing
calendar	Calendar & scheduling	NestJS backend, SvelteKit web, Astro landing
clock	Pomodoro & time tracking	NestJS backend, SvelteKit web, Astro landing
contacts	Contact management	NestJS backend, SvelteKit web
picture	AI image generation	NestJS backend, Expo mobile, SvelteKit web, Astro landing
cards	Card/deck management	NestJS backend, Expo mobile, SvelteKit web
zitare	Daily inspiration quotes	NestJS backend, Expo mobile, SvelteKit web, Astro landing
mukke	Music player	NestJS backend, SvelteKit web
planta	Plant care tracker	NestJS backend, SvelteKit web
storage	Cloud storage	NestJS backend, SvelteKit web
questions	Q&A with web search	SvelteKit web
skilltree	Skill tree visualization	NestJS backend, SvelteKit web
nutriphi	Nutrition tracking	NestJS backend, SvelteKit web
citycorners	City guide	NestJS backend, SvelteKit web, Astro landing
presi	Presentation tool	NestJS backend, SvelteKit web
photos	Photo management	NestJS backend, SvelteKit web

Getting Started

Prerequisites

Node.js 20+
pnpm 9.15.0+
Docker (for PostgreSQL, Redis, MinIO)

Installation

pnpm install

Development

# Start infrastructure (PostgreSQL, Redis, MinIO)
pnpm docker:up

# Start any app with auto DB setup
pnpm dev:chat:full
pnpm dev:todo:full
pnpm dev:calendar:full
pnpm dev:contacts:full

# Build & quality
pnpm run build
pnpm run type-check
pnpm run format

See CLAUDE.md for comprehensive development documentation.

Architecture

mana-monorepo/
├── apps/                    # Product applications
├── services/                # Microservices (auth, search, LLM, bots)
├── packages/                # Shared packages
├── docker/                  # Docker configuration
└── scripts/                 # Development & deployment scripts

Tooling

Package Manager: pnpm 9.15.0
Build System: Turborepo
Formatting: Prettier (tabs, single quotes, 100 char width)
Hosting: Mac Mini (self-hosted) via Docker + Cloudflare Tunnel
Analytics: Umami (stats.mana.how)