Reinforcement Learning for Self-Improving Agent with Skill Library Paper • 2512.17102 • Published 14 days ago • 29
3D-RE-GEN: 3D Reconstruction of Indoor Scenes with a Generative Framework Paper • 2512.17459 • Published 13 days ago • 11
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation Paper • 2509.25849 • Published Sep 30, 2025 • 47
neutts-air Collection NeuTTS Air is a speech foundation model that runs on CPU in real-time, with instant voice cloning. • 3 items • Updated Oct 9, 2025 • 15
Dream-Coder 7B Collection https://hkunlp.github.io/blog/2025/dream-coder • 2 items • Updated Jul 15, 2025 • 6
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL +4 Jun 3, 2025 • 96