managarten

mirror of https://github.com/Memo-2023/mana-monorepo.git synced 2026-05-15 05:01:08 +02:00

History

Till JS 16e0d99c5a feat(gpu-server): complete GPU server setup with AI services, monitoring, and public access - Set up 5 AI services on Windows GPU server (RTX 3090): - mana-llm (Port 3025): OpenAI-compatible LLM gateway via Ollama - mana-stt (Port 3020): WhisperX with word timestamps + speaker diarization - mana-tts (Port 3022): Kokoro (EN) + Edge TTS (DE) + Piper (local DE) - mana-image-gen (Port 3023): FLUX.2 klein 4B image generation - Ollama (Port 11434): gemma3:4b/12b, qwen2.5-coder:14b, nomic-embed-text - Add @manacore/shared-gpu TypeScript client package with SttClient, TtsClient, ImageClient - Add CUDA-compatible whisper_service using faster-whisper for Windows - Configure public access via Cloudflare Tunnel (gpu-llm/stt/tts/img.mana.how) - Add Loki log aggregator (Docker on Mac Mini) + log shipper on GPU server - Add GPU scrape targets to Prometheus/VictoriaMetrics config - Add Grafana Loki datasource for GPU service logs - Add health check with auto-restart, log rotation, and log shipping - Document complete setup: Always-On config, troubleshooting, architecture Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>		2026-03-27 21:35:30 +01:00
..
__init__.py	✨ feat(stt): add speech-to-text service for Mac Mini	2026-01-27 01:33:10 +01:00
auth.py	✨ feat(auth): add API key management for STT/TTS services	2026-02-12 02:12:05 +01:00
external_auth.py	✨ feat(auth): add API key management for STT/TTS services	2026-02-12 02:12:05 +01:00
main.py	🔒️ feat(stt,tts): add API key authentication with rate limiting	2026-02-11 18:04:22 +01:00
vllm_service.py	✨ feat(mana-stt): add vLLM integration for Voxtral transcription	2026-02-11 16:10:00 +01:00
voxtral_api_service.py	✨ feat(mana-stt): add vLLM integration for Voxtral transcription	2026-02-11 16:10:00 +01:00
voxtral_service.py	✨ feat(mana-stt): add vLLM integration for Voxtral transcription	2026-02-11 16:10:00 +01:00
whisper_service.py	fix(stt): change default model to large-v3 (large-v3-turbo not supported by lightning-whisper-mlx)	2026-01-27 01:36:49 +01:00
whisper_service_cuda.py	feat(gpu-server): complete GPU server setup with AI services, monitoring, and public access	2026-03-27 21:35:30 +01:00