mirror of
https://github.com/Memo-2023/mana-monorepo.git
synced 2026-05-14 23:21:08 +02:00
feat(gpu-server): add API key auth, VRAM management, and Piper TTS voices
- Add API key authentication to all GPU services (X-API-Key header) - /health and /docs remain public (no key needed) - Shared key configured via GPU_API_KEY env variable - Add VRAM auto-unload for mana-image-gen (5min) and mana-stt (10min) - FLUX.2 pipeline freed after idle, recovering ~13GB VRAM - WhisperX models freed after idle, recovering ~3GB VRAM - Install Piper TTS voices (Thorsten + Kerstin) for local German TTS - Update @manacore/shared-gpu client to support apiKey parameter - Add GPU_API_KEY to .env.development - Document API auth and VRAM management in setup guide Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
parent
97ef728eca
commit
c67ed0df14
7 changed files with 65 additions and 6 deletions
|
|
@ -614,13 +614,19 @@ GPU Server (healthcheck.py → log-shipper.py)
|
|||
Shared Package im Monorepo (`packages/shared-gpu/`) für alle GPU-Services:
|
||||
|
||||
```typescript
|
||||
import { GpuClient, GPU_PUBLIC_URLS } from '@manacore/shared-gpu';
|
||||
import { GpuClient } from '@manacore/shared-gpu';
|
||||
|
||||
// Öffentlich (von überall)
|
||||
const gpu = new GpuClient({ baseUrl: 'https://gpu.mana.how' });
|
||||
// Öffentlich (von überall, mit API-Key)
|
||||
const gpu = new GpuClient({
|
||||
baseUrl: 'https://gpu.mana.how',
|
||||
apiKey: process.env.GPU_API_KEY,
|
||||
});
|
||||
|
||||
// Oder LAN (direkt, schneller)
|
||||
const gpuLan = new GpuClient({ baseUrl: 'http://192.168.178.11' });
|
||||
const gpuLan = new GpuClient({
|
||||
baseUrl: 'http://192.168.178.11',
|
||||
apiKey: process.env.GPU_API_KEY,
|
||||
});
|
||||
|
||||
// Speech-to-Text (mit Word-Timestamps + Speaker Diarization)
|
||||
const transcript = await gpu.stt.transcribe(audioBuffer, 'recording.wav', {
|
||||
|
|
@ -644,6 +650,37 @@ const health = await gpu.healthCheck();
|
|||
|
||||
---
|
||||
|
||||
## API-Authentifizierung
|
||||
|
||||
Alle GPU-Services erfordern einen API-Key für Zugriff auf geschützte Endpoints.
|
||||
`/health` und `/docs` sind öffentlich (kein Key nötig).
|
||||
|
||||
**API-Key:** In `.env.development` unter `GPU_API_KEY`
|
||||
|
||||
**Verwendung:**
|
||||
|
||||
```bash
|
||||
# Mit Header
|
||||
curl -H "X-API-Key: $GPU_API_KEY" https://gpu-llm.mana.how/v1/models
|
||||
|
||||
# Oder als Query-Parameter
|
||||
curl "https://gpu-stt.mana.how/models?api_key=$GPU_API_KEY"
|
||||
|
||||
# Health (kein Key nötig)
|
||||
curl https://gpu-llm.mana.how/health
|
||||
```
|
||||
|
||||
**Konfiguration auf dem GPU-Server:**
|
||||
|
||||
| Service | Env-Variable | Datei |
|
||||
|---|---|---|
|
||||
| mana-llm | `GPU_API_KEY` | `C:\mana\services\mana-llm\.env` |
|
||||
| mana-stt | `API_KEYS`, `INTERNAL_API_KEY` | `C:\mana\services\mana-stt\.env` |
|
||||
| mana-tts | `API_KEYS`, `INTERNAL_API_KEY` | `C:\mana\services\mana-tts\.env` |
|
||||
| mana-image-gen | `GPU_API_KEY` | `C:\mana\services\mana-image-gen\.env` |
|
||||
|
||||
---
|
||||
|
||||
## Fehlerbehebung
|
||||
|
||||
### Server nicht erreichbar (kein Ping, kein SSH)
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue