Scaling Spatial Intelligence with Multimodal Foundation Models Paper • 2511.13719 • Published 20 days ago • 44
Multimodal Evaluation of Russian-language Architectures Paper • 2511.15552 • Published 18 days ago • 78
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models 18 days ago • 26
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 20 days ago • 134
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models Paper • 2511.08577 • Published 26 days ago • 104
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published 27 days ago • 104
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published Oct 24 • 99
The Art of Scaling Reinforcement Learning Compute for LLMs Paper • 2510.13786 • Published Oct 15 • 30
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published Oct 3 • 97
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29 • 139
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation Paper • 2509.25716 • Published Sep 30 • 3
AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs Paper • 2509.08031 • Published Sep 9 • 21
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10 • 189
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions? Paper • 2509.04292 • Published Sep 4 • 57