mirror of
https://github.com/Memo-2023/mana-monorepo.git
synced 2026-05-14 20:21:09 +02:00
✨ feat(stt): add speech-to-text service for Mac Mini
Add mana-stt service with Whisper and Voxtral support for local transcription. Includes setup script and launchd integration for automatic startup on Mac Mini server. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
parent
aeabd21d4a
commit
bf0fa04e7e
10 changed files with 1206 additions and 0 deletions
|
|
@ -23,6 +23,7 @@ cd ~/projects/manacore-monorepo
|
|||
| Script | Purpose |
|
||||
|--------|---------|
|
||||
| `setup-autostart.sh` | Configure automatic startup on boot (run once) |
|
||||
| `setup-stt.sh` | Setup STT service (Whisper + Voxtral) |
|
||||
| `startup.sh` | Main startup script (called by launchd) |
|
||||
| `health-check.sh` | Check all services health |
|
||||
| `status.sh` | Show full system status |
|
||||
|
|
@ -143,6 +144,7 @@ Three services are configured to run automatically:
|
|||
| Cloudflared | `com.cloudflare.cloudflared` | Tunnel to Cloudflare |
|
||||
| Docker Startup | `com.manacore.docker-startup` | Start containers on boot |
|
||||
| Health Check | `com.manacore.health-check` | Check every 5 minutes |
|
||||
| STT Service | `com.manacore.stt` | Speech-to-Text (Whisper + Voxtral) |
|
||||
|
||||
### Manual Service Control
|
||||
|
||||
|
|
@ -238,4 +240,49 @@ Once running, services are available at:
|
|||
| Calendar API | https://calendar-api.mana.how |
|
||||
| Clock | https://clock.mana.how |
|
||||
| Clock API | https://clock-api.mana.how |
|
||||
| STT API | http://localhost:3020 (internal only) |
|
||||
| SSH | ssh mac-mini (via cloudflared) |
|
||||
|
||||
## Native Services (non-Docker)
|
||||
|
||||
### Ollama (LLM)
|
||||
|
||||
Ollama runs natively on Mac Mini for LLM inference:
|
||||
|
||||
```bash
|
||||
# Check status
|
||||
curl http://localhost:11434/api/tags
|
||||
|
||||
# List models
|
||||
ollama list
|
||||
|
||||
# Pull a model
|
||||
ollama pull gemma3:4b
|
||||
```
|
||||
|
||||
### STT Service (Speech-to-Text)
|
||||
|
||||
The STT service provides Whisper and Voxtral transcription:
|
||||
|
||||
```bash
|
||||
# Setup (first time)
|
||||
./scripts/mac-mini/setup-stt.sh
|
||||
|
||||
# Check status
|
||||
curl http://localhost:3020/health
|
||||
|
||||
# Transcribe audio
|
||||
curl -X POST http://localhost:3020/transcribe \
|
||||
-F "file=@audio.mp3" \
|
||||
-F "language=de"
|
||||
|
||||
# View logs
|
||||
tail -f /tmp/manacore-stt.log
|
||||
```
|
||||
|
||||
**Available endpoints:**
|
||||
- `POST /transcribe` - Whisper transcription (recommended)
|
||||
- `POST /transcribe/voxtral` - Voxtral transcription
|
||||
- `POST /transcribe/auto` - Auto-select model
|
||||
- `GET /health` - Health check
|
||||
- `GET /models` - List available models
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue