Reinforcement World Model Learning for LLM-based Agents Paper • 2602.05842 • Published 5 days ago • 21
SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation Paper • 2602.02402 • Published 8 days ago • 31
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Paper • 2602.04634 • Published 6 days ago • 89
Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation Paper • 2602.03806 • Published 7 days ago • 5
Unified Personalized Reward Model for Vision Generation Paper • 2602.02380 • Published 8 days ago • 19
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs Paper • 2602.03048 • Published 8 days ago • 33
Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks Paper • 2602.01630 • Published 9 days ago • 46
Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Paper • 2601.16163 • Published 19 days ago • 13
FrankenMotion: Part-level Human Motion Generation and Composition Paper • 2601.10909 • Published 26 days ago • 18
Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text Paper • 2601.10355 • Published 26 days ago • 39
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 26 days ago • 28
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset Paper • 2601.10305 • Published 26 days ago • 36