fix(monitoring): status-page covers all standalone apps + restore who.mana.how routing

Audit revealed status.mana.how was probing only the unified mana-app
path-routes (mana.how/{module}) plus a couple of GPU services. None
of the standalone deployments were monitored, and three probe targets
were stale.

Changes:

- prometheus.yml blackbox-web: drop mana.how/{context,who} (context
  module was dropped 2026-04-29; mana.how/who never existed —
  /who is a standalone stack on its own subdomain). Add the eight
  hosts that DO have separate deployments today: whopxl, manavoxel,
  memoro (landing), cards (Phase-1 spinoff), who.mana.how/cantina,
  npm (Verdaccio).
- prometheus.yml blackbox-api: add memoro-api/health,
  memoro-audio/health, who-api.mana.how/api/decks,
  admin.mana.how/health (admin's root is auth-walled, only /health
  returns 200).
- prometheus.yml blackbox-gpu: add gpu-llm.mana.how/health (was
  missing; gpu-stt/tts/img/video were in, gpu-llm was somehow not).
- cloudflared-config.yml: restore who.mana.how → :5092 +
  who-api.mana.how → :3092. The DNS CNAME points at the Mini tunnel
  but the route entries had been lost during a previous compose
  cleanup, so every who.* request was hitting the catch-all 404 and
  the standalone Bun stack was effectively orphaned at the edge
  (PM2 + LaunchAgent all healthy on Mini, just no public route).

Live state after rollout: status.mana.how shows 57/59 services UP,
the two remaining DOWN are pre-existing — photon-self (Phase-2c
cross-LAN routing limitation, documented in PLAN_OPTION_C.md) and
whopxl-web (container not running on the Mini, separate issue).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
Till JS 2026-05-07 14:09:31 +02:00
parent daa1ef0513
commit 0ae1e70bf1
2 changed files with 29 additions and 3 deletions

View file

@ -256,7 +256,7 @@ scrape_configs:
- https://mana.how/citycorners
- https://mana.how/memoro
- https://mana.how/moodlit
- https://mana.how/context
# mana.how/context: Modul wurde 2026-04-29 gedropt (Commit 1815139dc) — Probe entfernt
- https://mana.how/questions
- https://mana.how/uload
- https://mana.how/notes
@ -271,12 +271,21 @@ scrape_configs:
- https://mana.how/events
- https://mana.how/finance
- https://mana.how/places
- https://mana.how/who
# mana.how/who: existiert nicht im unified-app — Who läuft als Standalone-Stack auf who.mana.how
- https://mana.how/news
- https://mana.how/mail
- https://mana.how/playground
# Standalone games (separate containers)
# ─── Standalone Apps / Games (separate Container, eigene Tunnel-Hostnames) ───
- https://whopxl.mana.how
- https://manavoxel.mana.how
# Memoro Standalone-Stack (Phase 2 mana e.V. Plattform-Migration)
- https://memoro.mana.how
# Cards Standalone (Phase-1 Spinoff vom Unified-mana-Modul, 2026-05-06)
- https://cards.mana.how
# Who? Game (Standalone-Bun-Stack auf Mac Mini, native unter PM2)
- https://who.mana.how/cantina
# npm-Registry (mana e.V. Plattform-Repo, Verdaccio)
- https://npm.mana.how
relabel_configs:
- source_labels: [__address__]
target_label: __param_target
@ -294,6 +303,13 @@ scrape_configs:
- targets:
- https://auth.mana.how/health
- https://api.mana.how/health
# Memoro standalone API + Audio (Phase 2 platform migration)
- https://memoro-api.mana.how/health
- https://memoro-audio.mana.how/health
# who.mana.how API on /api/decks — root is 404 by design (Phaser-Cantina mounts at /cantina)
- https://who.mana.how/api/decks
# Verein backoffice (mana e.V. Plattform); only /health returns 200, root is auth-walled
- https://admin.mana.how/health
relabel_configs:
- source_labels: [__address__]
target_label: __param_target
@ -360,6 +376,7 @@ scrape_configs:
- https://gpu-tts.mana.how/health
- https://gpu-img.mana.how/health
- https://gpu-video.mana.how/health
- https://gpu-llm.mana.how/health
relabel_configs:
- source_labels: [__address__]
target_label: __param_target