Beyond Imitation: Reinforcement Learning for Active Latent Planning Paper • 2601.21598 • Published 12 days ago • 9
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 15 days ago • 40
Self-Hinting Language Models Enhance Reinforcement Learning Paper • 2602.03143 • Published 8 days ago • 26