LLMLingua
π
132
Compress prompts to speed up language model inference
A unified multimodal understanding and generation model.
Generate speech in a cloned voice
Generate audio from video and text prompts
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Huggingface space for JanusFlow-1.3B
What happened in open-source AI this year, and whatβs next?
Transcribe audio or YouTube videos into text with Whisper