Multi-Task GRPO: Reliable LLM Reasoning Across Tasks Paper • 2602.05547 • Published 3 days ago • 7
🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 182
This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs Paper • 2503.05856 • Published Mar 7, 2025 • 7