daiemon12
/

kyutai-tts-handler

Model card Files Files and versions

YAML Metadata Warning: empty or missing yaml metadata in repo card

Check out the documentation for more information.

Kyutai TTS Handler pour Hugging Face Endpoints

Déploiement rapide

Créez un nouveau repo sur Hugging Face : daiemon12/kyutai-tts-endpoint
Uploadez ces fichiers :
- handler.py
- requirements.txt
- README.md
Configuration de l'endpoint :

Model Repository: daiemon12/kyutai-tts-endpoint

Hardware: Intel Sapphire Rapids - 8 vCPUs · 16 GB
($0.268/h)

Ou mieux (recommandé pour production):
Hardware: NVIDIA T4 · 16GB VRAM
(~$0.60/h mais BEAUCOUP plus rapide)

Security: Protected ✅
Autoscaling: 0 to 2 replicas
Scale-to-zero: après 60 min ✅

Utilisation

import requests

response = requests.post(
    "https://xxxxx.endpoints.huggingface.cloud",
    headers={"Authorization": f"Bearer {HF_TOKEN}"},
    json={
        "inputs": "Bonjour, ceci est un test de synthèse vocale.",
        "parameters": {
            "language": "fr",  # ou "en", ou "auto"
            "speed": 1.0
        }
    }
)

audio_base64 = response.json()["audio"]

Performances attendues

Latence première requête : ~10-15s (chargement modèle)
Latence suivantes : 200-400ms
Qualité : État de l'art pour FR/EN
Streaming : 220ms du texte au premier audio

Alternative simple

Si vous voulez tester rapidement sans créer de repo :

Allez sur https://huggingface.co/spaces
Duplicate un Space TTS existant
Modifiez pour utiliser Kyutai

Mais pour production, utilisez l'endpoint avec ce handler !

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support