mirror of
https://github.com/Memo-2023/mana-monorepo.git
synced 2026-05-16 16:39:39 +02:00
Python/FastAPI service providing unified OpenAI-compatible API for Ollama and cloud LLM providers (OpenRouter, Groq, Together). Features: - Chat completions with streaming (SSE) - Vision/multimodal support - Embeddings generation - Multi-provider routing (provider/model format) - Prometheus metrics - Optional Redis caching
5 lines
124 B
Python
5 lines
124 B
Python
"""Streaming utilities for SSE responses."""
|
|
|
|
from .sse import stream_chat_completion
|
|
|
|
__all__ = ["stream_chat_completion"]
|