Richard Zhuang PRO
RZ412
AI & ML interests
LLM Routing, LLM + Games, Post-Training, Agents
Recent Activity
updated
a dataset 13 minutes ago
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_ac068f6ec published
a dataset 13 minutes ago
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_ac068f6ec updated
a dataset about 3 hours ago
DCAgent2/terminal_bench_2_seta_rl_qwen3_8b_20260316_013858-c3e305c6