managarten

till/managarten

Fork 0

mirror of https://github.com/Memo-2023/mana-monorepo.git synced 2026-05-14 22:01:09 +02:00

Commit graph

Author SHA1 Message Date

Author	SHA1	Message	Date
Till JS	d8a35afd99	infra(gpu-box): commit GPU-Box compose to repo + Phase 2e docs The GPU-Box stack has been carrying real production workload since Phase 2c (monitoring) but only existed as a /srv/mana/docker-compose.gpu-box.yml on the box itself. If the WSL filesystem dies, none of it is reproducible. Bring the file into infrastructure/ as the source of truth (live file on the box must be kept synchronous; manual rsync for now since there's no CD into the GPU box). Plus: - infrastructure/.env.gpu-box.example as the secrets template - infrastructure/README.md describing what runs there + how the Cloudflare-tunnel ingress is API-managed (not config.yml) - .gitignore for the live infrastructure/.env.gpu-box copy - MAC_MINI_SERVER.md status-page section now points at the GPU-Box setup instead of the long-stopped Mini container - PLAN_OPTION_C.md: Phase 2e row + GPU-Box service tree update Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-07 13:28:49 +02:00
Till JS	dd2e609545	fix(docker): COPY packages/cards-core in SvelteKit Dockerfiles The cards-spinoff commit (`0a544ac41`) added @mana/cards-core as a workspace dependency for apps/mana/apps/web but didn't update the two Dockerfiles that COPY-and-pnpm-install the workspace into the image. CD's --no-cache build for mana-web therefore failed at `pnpm install` with ERR_PNPM_WORKSPACE_PKG_NOT_FOUND, leaving the container on a stale pre-cleanup image whose ListView28 chunk still referenced the dropped contextSpaces Dexie table — every mana.how route 500'd. Adding the COPY line to both files (the shared sveltekit-base layer and the per-app layer that does a second pnpm install) makes the package available to the workspace resolver and lets the build go through. Plus the Phase 2c-d doc updates that piled up today (Glitchtip on dedicated GPU-box stack, gitignore for *_CREDENTIALS.md files). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-07 01:47:07 +02:00
Till JS	c14aef9f85	docs(infra): Mac-Mini ↔ Windows-GPU-Box workload-split — Plan Option C Hilfsdienste (Monitoring, Forgejo, Glitchtip, Umami) wandern von der auslastungs-kritischen Mac-Mini-Box auf die Windows-GPU-Box, die ohnehin 95 % System-RAM idle hat. Production-Hot-Path bleibt auf dem Mini, kein Geld ausgegeben, Single-Point-of-Failure am Standort reduziert. Stand 2026-05-06: Phase 0–2b shipped (WSL2-Docker, Grafana cross-box, Forgejo, Umami healthy). Phase 2c (Loki+VM+Alerts) und Phase 4 (Cloudflare-Cutover für grafana.mana.how) brauchen eigene Sessions — beides Pre-existing-Mis-config-Aufräumen, kein Architektur-Risiko. Hardware-Inventar in WINDOWS_GPU_SERVER_SETUP.md ergänzt: Ryzen 9 5950X, 64 GB DDR4, RTX 3090, 660 GB frei C:. WSL2 auf 24 GB / 12 vCPU gedeckelt damit AI-Scheduled-Tasks > 30 GB Reserve haben. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-06 20:39:01 +02:00

Till JS

d8a35afd99

infra(gpu-box): commit GPU-Box compose to repo + Phase 2e docs

The GPU-Box stack has been carrying real production workload since
Phase 2c (monitoring) but only existed as a /srv/mana/docker-compose.gpu-box.yml
on the box itself. If the WSL filesystem dies, none of it is
reproducible. Bring the file into infrastructure/ as the source of
truth (live file on the box must be kept synchronous; manual rsync
for now since there's no CD into the GPU box).

Plus:
- infrastructure/.env.gpu-box.example as the secrets template
- infrastructure/README.md describing what runs there + how the
  Cloudflare-tunnel ingress is API-managed (not config.yml)
- .gitignore for the live infrastructure/.env.gpu-box copy
- MAC_MINI_SERVER.md status-page section now points at the GPU-Box
  setup instead of the long-stopped Mini container
- PLAN_OPTION_C.md: Phase 2e row + GPU-Box service tree update

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-07 13:28:49 +02:00

Till JS

dd2e609545

fix(docker): COPY packages/cards-core in SvelteKit Dockerfiles

The cards-spinoff commit (0a544ac41) added @mana/cards-core as a
workspace dependency for apps/mana/apps/web but didn't update the
two Dockerfiles that COPY-and-pnpm-install the workspace into the
image. CD's --no-cache build for mana-web therefore failed at
`pnpm install` with ERR_PNPM_WORKSPACE_PKG_NOT_FOUND, leaving the
container on a stale pre-cleanup image whose ListView28 chunk still
referenced the dropped contextSpaces Dexie table — every mana.how
route 500'd.

Adding the COPY line to both files (the shared sveltekit-base layer
and the per-app layer that does a second pnpm install) makes the
package available to the workspace resolver and lets the build go
through.

Plus the Phase 2c-d doc updates that piled up today (Glitchtip
on dedicated GPU-box stack, gitignore for *_CREDENTIALS.md files).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-07 01:47:07 +02:00

Till JS

c14aef9f85

docs(infra): Mac-Mini ↔ Windows-GPU-Box workload-split — Plan Option C

Hilfsdienste (Monitoring, Forgejo, Glitchtip, Umami) wandern von der
auslastungs-kritischen Mac-Mini-Box auf die Windows-GPU-Box, die
ohnehin 95 % System-RAM idle hat. Production-Hot-Path bleibt auf dem
Mini, kein Geld ausgegeben, Single-Point-of-Failure am Standort
reduziert.

Stand 2026-05-06: Phase 0–2b shipped (WSL2-Docker, Grafana cross-box,
Forgejo, Umami healthy). Phase 2c (Loki+VM+Alerts) und Phase 4
(Cloudflare-Cutover für grafana.mana.how) brauchen eigene Sessions —
beides Pre-existing-Mis-config-Aufräumen, kein Architektur-Risiko.

Hardware-Inventar in WINDOWS_GPU_SERVER_SETUP.md ergänzt: Ryzen 9 5950X,
64 GB DDR4, RTX 3090, 660 GB frei C:. WSL2 auf 24 GB / 12 vCPU
gedeckelt damit AI-Scheduled-Tasks > 30 GB Reserve haben.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-06 20:39:01 +02:00

3 commits