Sparse Video Generation Propels Real-World Beyond-the-View Vision-Language Navigation Paper • 2602.05827 • Published 11 days ago • 18
EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric Demonstration Paper • 2602.10106 • Published 6 days ago • 20
RISE: Self-Improving Robot Policy with Compositional World Model Paper • 2602.11075 • Published 5 days ago • 27
χ_{0}: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies Paper • 2602.09021 • Published 7 days ago • 25
How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing Paper • 2602.01851 • Published 15 days ago • 16
Scaling and Beyond: Advancing Spatial Reasoning in MLLMs Requires New Recipes Paper • 2504.15037 • Published Apr 21, 2025
Optimization-Guided Diffusion for Interactive Scene Generation Paper • 2512.07661 • Published Dec 8, 2025 • 3
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs Paper • 2510.24514 • Published Oct 28, 2025 • 22
SimScale: Learning to Drive via Real-World Simulation at Scale Paper • 2511.23369 • Published Nov 28, 2025 • 39
SimScale: Learning to Drive via Real-World Simulation at Scale Paper • 2511.23369 • Published Nov 28, 2025 • 39
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark Paper • 2509.24897 • Published Sep 29, 2025 • 46
Reinforced Refinement with Self-Aware Expansion for End-to-End Autonomous Driving Paper • 2506.09800 • Published Jun 11, 2025 • 1