Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alshell7 's Collections
Medical
General
Speech to Speech
Animation
Datasets
Small/Tiny Models

Speech to Speech

updated 17 days ago
Upvote
-

  • Qwen/Qwen2.5-Omni-3B

    Any-to-Any • Updated Apr 30, 2025 • 231k • 328

  • Running on CPU Upgrade
    Featured
    1.22k

    Open ASR Leaderboard

    🏆
    1.22k

    Explore ASR model performance across languages and datasets


  • fishaudio/s1-mini

    Text-to-Speech • Updated 18 days ago • 5.33k • 593

  • fluxions/vui

    Text-to-Speech • Updated Jun 17, 2025 • 765 • 147

  • OpenMOSS-Team/MOSS-TTSD-v0

    Text-to-Speech • 2B • Updated Jun 20, 2025 • 27

  • nvidia/audio-flamingo-3

    Audio-Text-to-Text • Updated Nov 28, 2025 • 278 • 142

  • bosonai/higgs-audio-v2-generation-3B-base

    Text-to-Speech • Updated Jul 28, 2025 • 199k • 658

  • Vyvo/VyvoTTS-v0-Qwen3-0.6B

    Text-to-Speech • 0.8B • Updated Aug 9, 2025 • 110 • 25

  • nvidia/canary-1b-v2

    Automatic Speech Recognition • Updated Dec 3, 2025 • 287k • 361

  • nvidia/diar_streaming_sortformer_4spk-v2

    Automatic Speech Recognition • Updated Dec 31, 2025 • 31.3k • 102

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 141k • 2.22k

  • stepfun-ai/Step-Audio-2-mini

    Any-to-Any • Updated 10 days ago • 1.86k • 250

  • FireRedTeam/FireRedTTS2

    Updated Sep 17, 2025 • 65

  • ThomasG/faster-whisper-large-v3-turbo-int8-fp16

    Automatic Speech Recognition • Updated 17 days ago • 13
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs