CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production Paper • 2603.01973 • Published 13 days ago • 6
Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions Paper • 2602.13013 • Published about 1 month ago • 45
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published Feb 12 • 91
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models Paper • 2601.22060 • Published Jan 29 • 154