When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published 1 day ago • 12
BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation Paper • 2602.09849 • Published 1 day ago • 15
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning Paper • 2602.10090 • Published 1 day ago • 44
Chain of Mindset: Reasoning with Adaptive Cognitive Modes Paper • 2602.10063 • Published 1 day ago • 68
Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published 1 day ago • 179
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 7 days ago • 287
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published about 23 hours ago • 121
Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments Paper • 2602.01244 • Published 11 days ago • 13
OpenDataArena/MMFineReason-SFT-123K-Qwen3-VL-235B-Thinking Viewer • Updated 9 days ago • 123k • 713 • 72
AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent Paper • 2602.03955 • Published 8 days ago • 8
LatentMem: Customizing Latent Memory for Multi-Agent Systems Paper • 2602.03036 • Published 9 days ago • 14
OpenDataArena/MMFineReason-1.8M-Qwen3-VL-235B-Thinking Viewer • Updated 13 days ago • 1.81M • 3.56k • 117
MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods Paper • 2601.21821 • Published 14 days ago • 59
DIFFA-2: A Practical Diffusion Large Language Model for General Audio Understanding Paper • 2601.23161 • Published 13 days ago • 10