Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kreshnik 's Collections
music
OCR
3D
Language
Image
Voice
Papers
Model training

Voice

updated Jan 25
Upvote
-

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 126k • 2.25k

  • Running
    Featured
    444

    FastVLM WebGPU

    🍎
    444

    Real-time video captioning powered by FastVLM


  • openbmb/VoxCPM-0.5B

    Text-to-Speech • Updated Sep 19, 2025 • 587 • 767

  • Running on CPU Upgrade
    77

    MiMo-Audio-Chat

    💬
    77

    Chat with Xiaomi MiMo-Audio using voice


  • FlashLabs/Chroma-4B

    Any-to-Any • Updated Jan 28 • 1.54k • 342

  • numind/NuMarkdown-8B-Thinking

    Image-to-Text • Updated Nov 13, 2025 • 67.2k • 448
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs