managarten/docker
Till JS 0ae1e70bf1 fix(monitoring): status-page covers all standalone apps + restore who.mana.how routing
Audit revealed status.mana.how was probing only the unified mana-app
path-routes (mana.how/{module}) plus a couple of GPU services. None
of the standalone deployments were monitored, and three probe targets
were stale.

Changes:

- prometheus.yml blackbox-web: drop mana.how/{context,who} (context
  module was dropped 2026-04-29; mana.how/who never existed —
  /who is a standalone stack on its own subdomain). Add the eight
  hosts that DO have separate deployments today: whopxl, manavoxel,
  memoro (landing), cards (Phase-1 spinoff), who.mana.how/cantina,
  npm (Verdaccio).
- prometheus.yml blackbox-api: add memoro-api/health,
  memoro-audio/health, who-api.mana.how/api/decks,
  admin.mana.how/health (admin's root is auth-walled, only /health
  returns 200).
- prometheus.yml blackbox-gpu: add gpu-llm.mana.how/health (was
  missing; gpu-stt/tts/img/video were in, gpu-llm was somehow not).
- cloudflared-config.yml: restore who.mana.how → :5092 +
  who-api.mana.how → :3092. The DNS CNAME points at the Mini tunnel
  but the route entries had been lost during a previous compose
  cleanup, so every who.* request was hitting the catch-all 404 and
  the standalone Bun stack was effectively orphaned at the edge
  (PM2 + LaunchAgent all healthy on Mini, just no public route).

Live state after rollout: status.mana.how shows 57/59 services UP,
the two remaining DOWN are pre-existing — photon-self (Phase-2c
cross-LAN routing limitation, documented in PLAN_OPTION_C.md) and
whopxl-web (container not running on the Mini, separate issue).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-07 14:09:31 +02:00
..
alert-notifier feat: rename ManaCore to Mana across entire codebase 2026-04-05 20:00:13 +02:00
alertmanager feat: rename ManaCore to Mana across entire codebase 2026-04-05 20:00:13 +02:00
blackbox feat(monitoring): add uptime monitoring via Blackbox Exporter 2026-03-31 17:43:25 +02:00
grafana chore(observability): Grafana dashboard for agent-loop metrics 2026-04-23 18:09:32 +02:00
init-db feat(mail): add mana-mail service and frontend module (Phase 1 MVP) 2026-04-13 20:35:54 +02:00
loki feat(gpu-server): complete GPU server setup with AI services, monitoring, and public access 2026-03-27 21:35:30 +01:00
nginx refactor: rename zitare -> quotes (Zitate) 2026-04-14 20:59:16 +02:00
postgres fix(infra): use postgres -c flags instead of config_file override 2026-03-24 11:42:42 +01:00
prometheus fix(monitoring): status-page covers all standalone apps + restore who.mana.how routing 2026-05-07 14:09:31 +02:00
promtail fix(mana-auth) + chore: rewrite /api/v1/auth/login JWT mint, remove Matrix stack 2026-04-08 16:32:13 +02:00
shared 🐛 fix(docker): add missing build-shared-packages.sh script for Docker builds 2025-12-25 20:51:15 +01:00
templates chore: remove all NestJS backend references, replace with Hono/Bun 2026-03-31 16:52:25 +02:00
tempo feat(mana-ai): OpenTelemetry tracing + Grafana Tempo backend 2026-04-16 15:21:23 +02:00
Dockerfile.hono-server feat(infra): add docker-compose for new Hono services + DB init 2026-03-28 17:54:24 +01:00
Dockerfile.sveltekit-base fix(docker): COPY packages/cards-core in SvelteKit Dockerfiles 2026-05-07 01:47:07 +02:00