Zhouqi Hua
ZhouqiHUA
AI & ML interests
reasoning LLM
Recent Activity
upvoted
a
paper
3 days ago
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization
liked
a dataset
16 days ago
openai/gsm8k
upvoted
a
paper
about 2 months ago
Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning
Organizations
None yet