FailSafe: Reasoning and Recovery from Failures in Vision-Language-Action Models Paper • 2510.01642 • Published Oct 2, 2025 • 1
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics Paper • 2602.19313 • Published 9 days ago • 23
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning Paper • 2602.07845 • Published 24 days ago • 69
VLS: Steering Pretrained Robot Policies via Vision-Language Models Paper • 2602.03973 • Published 28 days ago • 22
MolmoAct Collection All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated Dec 23, 2025 • 35
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 4 items • Updated Dec 23, 2025 • 18
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published Aug 11, 2025 • 44
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Paper • 2501.18564 • Published Jan 30, 2025 • 2
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper • 2505.09990 • Published May 15, 2025 • 12