Running on Zero MCP 385 Multimodal OCR 🍍 385 nanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
Runtime error 216 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System 🎙 216 Generate speech from text using a reference audio
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 22 days ago • 257k • 1.55k
Running on Zero Featured 2.75k F5-TTS 🗣 2.75k F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Running 575 NIST FRVT TOP 1 Face Recognition, Face Liveness Detection, Face Analysis 🥇 575 Compare and analyze faces in images
Running 172 MiniAiLive Face Recognition WebAPI Playground 🥇 172 Advanced 1:1 & 1:N Face Matching Technology, On-premise SDK