managarten

mirror of https://github.com/Memo-2023/mana-monorepo.git synced 2026-05-15 00:01:10 +02:00

Author	SHA1	Message	Date
Till JS	3f91c4656a	feat(infra): add deploy tracking with PostgreSQL, Pushgateway & Grafana dashboard Instrument the CD pipeline to record per-deploy and per-service metrics (build time, image size, startup time, health status) into PostgreSQL and push gauges to Pushgateway. Adds a Grafana dashboard with 13 panels covering deploy frequency, build performance, service health, and history. New files: - scripts/mac-mini/init-deploy-tracking.sql (idempotent DDL) - scripts/deploy-metrics.sh (bash library for CI) - docker/grafana/provisioning/datasources/deploy-tracking.yml - docker/grafana/dashboards/deploy-tracking.json Modified: - docker/prometheus/prometheus.yml (pushgateway scrape job) - .github/workflows/cd-macmini.yml (build/health instrumentation) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 17:08:03 +01:00
Till-JS	acc8de36ee	feat(monitoring): add alerting stack and maintenance scripts Medium priority stability improvements: Alerting: - Add vmalert for evaluating Prometheus alert rules - Add alertmanager for alert routing and grouping - Add alert-notifier service for Telegram/ntfy notifications - Enable cadvisor scraping in prometheus config Disk Monitoring: - Add check-disk-space.sh for hourly disk monitoring - Alert on 80% (warning) and 90% (critical) thresholds - Auto-cleanup Docker when disk is critical - Add com.manacore.disk-check.plist for LaunchD Weekly Reports: - Add weekly-report.sh for system health summary - Includes: backup status, disk usage, container health, database stats, error log summary - Runs every Sunday at 10 AM via LaunchD Health Check Updates: - Add checks for vmalert, alertmanager, alert-notifier Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-12 13:46:57 +01:00
Till-JS	fe33f4b355	✅ fix(mana-core-auth): complete production readiness with test fixes - Fix LoggerService mock in better-auth.service.spec.ts - Fix name assertion in auth.controller.spec.ts (empty string fallback) - Fix createRemoteJWKSet mock in jwt-auth.guard.spec.ts - Add Grafana dashboard for Auth Service monitoring - Add 10 auth-specific Prometheus alert rules - Update production readiness plan to 100% complete All 199 unit tests passing. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 14:18:58 +01:00
Till-JS	7aa5115c78	📊 feat(monitoring): add node-exporter for host system metrics - Add node-exporter service to docker-compose for CPU/Memory/Disk monitoring - Enable node-exporter scrape target in Prometheus config - Update System Overview dashboard with Host System section: - CPU, Memory, Disk usage gauges - Total RAM, Total Disk, Uptime, Load stats - CPU & Memory over time graph - Network I/O graph - Add Node Exporter to service status panel Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 12:38:44 +01:00
Till-JS	1b39aa8308	🔧 fix(prometheus): disable non-existent scrape targets Commented out: - node-exporter (container not deployed) - cadvisor (container not deployed) - storage/presi/nutriphi-backend (no /metrics endpoint yet) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 05:53:22 +01:00
Till-JS	dac6a85427	🔧 fix(prometheus): correct backend ports and add missing services - chat-backend: 3002 → 3030 - todo-backend: 3018 → 3031 - calendar-backend: 3016 → 3032 - clock-backend: 3017 → 3033 - contacts-backend: 3015 → 3034 - Add storage-backend (3035), presi-backend (3036), nutriphi-backend (3037) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 05:51:50 +01:00
Till-JS	edf13b7102	revert: fix CI by reverting Telegram notifications Reverting `618c58c5` which broke the CI workflow. Will re-add notifications after fixing the issue. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-26 10:40:10 +01:00
Till-JS	618c58c519	feat(ci): add Telegram notifications and Grafana CI/CD dashboard - Add notify-start job with Telegram notification for build start - Add notify-complete job with build status and duration notification - Push CI metrics to Prometheus Pushgateway for Grafana visualization - Create CI/CD Grafana dashboard with build status, duration, and history - Add Pushgateway scrape config to Prometheus Requires TELEGRAM_BOT_TOKEN, TELEGRAM_CHAT_ID, and PUSHGATEWAY_URL secrets. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-26 10:31:17 +01:00
Till-JS	8c259a008b	feat(monitoring): add comprehensive Grafana dashboards and alerting New dashboards: - Application Details: Node.js runtime (heap, event loop, GC), HTTP details (status codes, methods, top routes), error analysis - Database Details: PostgreSQL and Redis metrics with detailed breakdowns Alerting rules (docker/prometheus/alerts.yml): - Service: down, high/very high error rate, slow response time - Infrastructure: high CPU/memory/disk usage - Database: PostgreSQL/Redis down, high connections, low cache hit - Container: high CPU/memory, restarts All dashboards include service selector variable for filtering. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-26 09:47:18 +01:00
Till-JS	6d86a08d63	feat: add monitoring dashboard (Prometheus + Grafana + Umami + Admin) Phase 1: Infrastructure - Add docker/prometheus/prometheus.yml with scrape configs for all services - Add docker/grafana/provisioning for auto-configured datasources - Add docker/grafana/dashboards (system-overview, backends-docker) - Update docker-compose.macmini.yml with monitoring services: - prometheus, grafana, node-exporter, cadvisor - postgres-exporter, redis-exporter, umami - Add grafana.mana.how and analytics.mana.how to Caddyfile Phase 2: Backend Metrics - Create packages/shared-nestjs-metrics with: - MetricsModule (auto /metrics endpoint) - MetricsService (Counter, Histogram, Gauge helpers) - MetricsMiddleware (auto HTTP request tracking) Phase 3: Umami Web Analytics - Add Umami tracking scripts to all landing pages - Add Umami tracking scripts to all web apps - Create scripts/mac-mini/setup-umami-db.sh Phase 4: Admin Dashboard (ManaCore Web) - Add admin routes: /admin, /admin/users, /admin/system - Create StatCard, QuickLinks, UserTable components - Add Admin link to navigation Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 15:31:39 +01:00

10 commits