arxiv:2509.15207
Kaiyan Zhang
iseesaw
AI & ML interests
Large Reasoning Models, Reinforcement Learning, Agent
Recent Activity
upvoted
a
paper
about 21 hours ago
GEBench: Benchmarking Image Generation Models as GUI Environments
upvoted
a
paper
about 21 hours ago
AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents
liked
a dataset
2 months ago
OpenRubrics/OpenRubrics