view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 13 days ago • 235
view article Article 🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker! Jan 29 • 21
view article Article Building a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac Oct 29 • 29
view article Article How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons Sep 30 • 44
Open X-Embodiment Collection Datasets from Open X-Embodiment (OXE) in LeRobot dataset format • 57 items • Updated Oct 2 • 8
RDT 2 Collection RDT 2, the sequel to RDT-1B, is the first foundation model that achieves zero-shot deployment on unseen embodiments for simple open-vocabulary tasks. • 4 items • Updated Sep 26 • 16
view article Article `LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot` +9 Sep 16 • 47
view article Article Welcome PaliGemma 2 – New vision language models by Google +2 Dec 5, 2024 • 162
π_0: A Vision-Language-Action Flow Model for General Robot Control Paper • 2410.24164 • Published Oct 31, 2024 • 30
π_{0.5}: a Vision-Language-Action Model with Open-World Generalization Paper • 2504.16054 • Published Apr 22 • 3