Richard Zhuang's picture

Richard Zhuang PRO

RZ412

·

https://richardzhuang0412.github.io

AI & ML interests

LLM Routing, LLM + Games, Post-Training, Agents

Recent Activity

updated a dataset 13 minutes ago

DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_ac068f6ec

published a dataset 13 minutes ago

DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_ac068f6ec

updated a dataset about 3 hours ago

DCAgent2/terminal_bench_2_seta_rl_qwen3_8b_20260316_013858-c3e305c6

View all activity

Organizations

New activity in open-r1/README 11 months ago

[Experiment] Training R1-Zero-like models with Open R1

#20 opened 12 months ago by

New activity in huggingface/HuggingDiscussions about 1 year ago

[FEEDBACK] Daily Papers

#32 opened almost 2 years ago by

New activity in RZ412/PokerBench about 1 year ago

Fix formatting

#4 opened about 1 year ago by

Add task category, paper and code links

#3 opened about 1 year ago by

add minimal metadata

#2 opened about 1 year ago by