F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate Talking avatars from Text-to-Speech
Clone a voice and generate speech from your text
Transcribe or translate audio and YouTube videos to text
Identify emotion from multi-lingual audio
Combine voice cloning and portrait lipsync animation
Voice conversion framework based on VITS
Transcribe spoken audio into written text
Transcribe audio to text with speaker diarization
Combine and process audio files with effects
Transcribe audio files into text
Generate subtitled videos from YouTube links
Convert audio to subtitles
Generate Cantonese speech from text
Generate high-quality speech from text using a prompt audio