Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 5 days ago • 47
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information Paper • 2510.03632 • Published Oct 4 • 42
Self-Rewarding Vision-Language Model via Reasoning Decomposition Paper • 2508.19652 • Published Aug 27 • 84
MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment Paper • 2507.05720 • Published Jul 8 • 2
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning Paper • 2505.16421 • Published May 22 • 19
Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data Paper • 2502.14044 • Published Feb 19 • 8
SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering? Paper • 2502.13233 • Published Feb 18 • 15