mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition โข Updated about 16 hours ago โข 5.21k โข 516
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper โข 2601.05242 โข Published Jan 8 โข 225
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation Paper โข 2602.01756 โข Published 12 days ago โข 22
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper โข 2602.03796 โข Published 11 days ago โข 56
PaperBanana: Automating Academic Illustration for AI Scientists Paper โข 2601.23265 โข Published 15 days ago โข 178
nvidia/canary-qwen-2.5b Automatic Speech Recognition โข 3B โข Updated Dec 15, 2025 โข 144k โข 371
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper โข 2601.22153 โข Published 16 days ago โข 68
Running 108 The Eiffel Tower Llama ๐ 108 Explore the Eiffel Tower Llama experiment with open-source models
Running on Zero MCP Featured 1.7k Z Image Turbo ๐ 1.7k Generate images from text prompts with adjustable size and seed
Running Featured 103 Supertonic TTS WebGPU โก 103 Blazingly fast text-to-speech 100% locally in your browser
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 โข 297