HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text • 0.5B • Updated
• 22.9k • 188
Transcribe audio files and YouTube videos into text
Generate videos from text prompts and optional images
Track your online presence with reverse face search
Generate a 3D mesh model from an image
Generate and preview code from your app description
flux.1-dev / flux.1-krea-dev
Import a portrait, click to move the head!
Chat with Mini-Omni 2 - powered by Gradio and WebRTC ⚡️
Add vectors to Hub datasets and do in memory vector search.
An end-to-end (e2e) Voice Language Model by Fish Audio.