Till JS
4a48182677
feat(monitoring): integrate Promtail for centralized log collection via Loki
...
Loki was already running but had no log shipper. Adds Promtail to collect
Docker logs from all 66 containers with automatic tier labeling (infra,
auth, core, app, matrix, games) and a Grafana Logs Explorer dashboard.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-29 19:22:44 +02:00
Till JS
16e0d99c5a
feat(gpu-server): complete GPU server setup with AI services, monitoring, and public access
...
- Set up 5 AI services on Windows GPU server (RTX 3090):
- mana-llm (Port 3025): OpenAI-compatible LLM gateway via Ollama
- mana-stt (Port 3020): WhisperX with word timestamps + speaker diarization
- mana-tts (Port 3022): Kokoro (EN) + Edge TTS (DE) + Piper (local DE)
- mana-image-gen (Port 3023): FLUX.2 klein 4B image generation
- Ollama (Port 11434): gemma3:4b/12b, qwen2.5-coder:14b, nomic-embed-text
- Add @manacore/shared-gpu TypeScript client package with SttClient, TtsClient, ImageClient
- Add CUDA-compatible whisper_service using faster-whisper for Windows
- Configure public access via Cloudflare Tunnel (gpu-llm/stt/tts/img.mana.how)
- Add Loki log aggregator (Docker on Mac Mini) + log shipper on GPU server
- Add GPU scrape targets to Prometheus/VictoriaMetrics config
- Add Grafana Loki datasource for GPU service logs
- Add health check with auto-restart, log rotation, and log shipping
- Document complete setup: Always-On config, troubleshooting, architecture
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-27 21:35:30 +01:00