managarten

mirror of https://github.com/Memo-2023/mana-monorepo.git synced 2026-05-21 14:06:42 +02:00

Author	SHA1	Message	Date
Till-JS	21d50d1e0b	📝 docs(mana-stt): document Whisper + Mistral API architecture - Disable vLLM by default (has issues on macOS CPU) - Use Mistral API for Voxtral transcription (cloud-based) - Keep Whisper-MLX for local transcription - Update README with architecture diagram Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-11 16:34:03 +01:00
Till-JS	7c9c2645e3	🐛 fix(mana-stt): adjust vLLM config for CPU mode - Reduce max-model-len to 4096 for CPU compatibility - Add max-num-batched-tokens matching the context size - Add enforce-eager for stable CPU inference Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-11 16:14:14 +01:00
Till-JS	60394076e5	✨ feat(mana-stt): add vLLM integration for Voxtral transcription - Add vllm_service.py as proxy to vLLM server for Voxtral 3B/4B - Add voxtral_api_service.py for Mistral API fallback - Update main.py with /transcribe/voxtral endpoint using vLLM - Add /transcribe/auto endpoint with automatic fallback chain - Create setup-vllm.sh and start-vllm-voxtral.sh scripts - Add launchd plist files for Mac Mini deployment - Add install-services.sh for automated service installation Architecture: - vLLM server runs Voxtral models on port 8100 - mana-stt proxies to vLLM with Mistral API fallback - Fallback chain: vLLM -> Mistral API Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-11 16:10:00 +01:00
Till-JS	6402f287e8	feat(telegram-bot): add local STT support and Prometheus metrics - Fix telegram_user_id column type (integer -> bigint) for large user IDs - Add local STT support via mana-stt service (Whisper MLX + Voxtral) - Add STT provider config (local/openai) with fallback support - Add Grafana dashboard for mana-stt service metrics - Add ollama-metrics-proxy for LLM metrics collection - Add Grafana dashboard for Ollama LLM metrics Services added/updated: - telegram-project-doc-bot: local STT integration - mana-stt: Grafana dashboard - ollama-metrics-proxy: new service for Ollama metrics Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-27 16:51:09 +01:00
Till-JS	bff80b552a	fix(stt): remove unsupported add_generation_prompt kwarg	2026-01-27 03:24:43 +01:00
Till-JS	a2233dc366	fix(stt): properly encode audio as base64 for Voxtral	2026-01-27 02:13:34 +01:00
Till-JS	49255ac794	fix(stt): use correct AutoModel for Voxtral multimodal architecture	2026-01-27 01:58:32 +01:00
Till-JS	92a700ac7e	fix(stt): change default model to large-v3 (large-v3-turbo not supported by lightning-whisper-mlx)	2026-01-27 01:36:49 +01:00
Till-JS	bf0fa04e7e	✨ feat(stt): add speech-to-text service for Mac Mini Add mana-stt service with Whisper and Voxtral support for local transcription. Includes setup script and launchd integration for automatic startup on Mac Mini server. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-27 01:33:10 +01:00

9 commits