Group 2 - a mkimitch Collection

mkimitch 's Collections

Group 2

Group 2

updated 4 days ago

Beyond Imitation: Reinforcement Learning for Active Latent Planning

Paper • 2601.21598 • Published 12 days ago • 9
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Paper • 2601.18778 • Published 15 days ago • 40
Self-Hinting Language Models Enhance Reinforcement Learning

Paper • 2602.03143 • Published 8 days ago • 26