OpenResearcher Collection OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis • 7 items • Updated about 3 hours ago • 10
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems Paper • 2602.08847 • Published 3 days ago • 18
OpenResearcher Collection OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis • 7 items • Updated about 3 hours ago • 10