MAI-UI Technical Report: Real-World Centric Foundation GUI Agents Paper • 2512.22047 • Published 3 days ago • 19
MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments Paper • 2512.19432 • Published 7 days ago • 10
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published Oct 16 • 47
ChartM^3: Benchmarking Chart Editing with Multimodal Instructions Paper • 2507.21167 • Published Jul 25 • 1
POLYCHARTQA: Benchmarking Large Vision-Language Models with Multilingual Chart Question Answering Paper • 2507.11939 • Published Jul 16 • 1
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models Paper • 2510.10606 • Published Oct 12 • 3
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published Oct 24 • 99
UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning Paper • 2510.20286 • Published Oct 23 • 23