AI & ML interests
None defined yet.
Recent Activity
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.1-epoch-3
8B
•
Updated
•
4
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-random-epoch-2
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-epoch-3
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.5-epoch-3
8B
•
Updated
•
3
uiuc-kang-lab/Qwen2.5-Math-7B-TIS-noise-0.5-epoch-3
8B
•
Updated
•
4
uiuc-kang-lab/Qwen2.5-Math-7B-SAPO-noise-0.5-epoch-3
8B
•
Updated
•
5
uiuc-kang-lab/Qwen2.5-Math-7B-DrGRPO-noise-0.5-epoch-3
8B
•
Updated
•
1
uiuc-kang-lab/Qwen2.5-Math-7B-DAPO-noise-0.5-epoch-3
8B
•
Updated
•
4
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-format-epoch-3
8B
•
Updated
•
3
uiuc-kang-lab/Qwen2.5-Math-7B-PGFC-noise-0.5-epoch-3
8B
•
Updated
•
5
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.4-epoch-3
8B
•
Updated
•
70
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.3-epoch-3
8B
•
Updated
•
48
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.2-epoch-3
8B
•
Updated
•
69
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-clean-epoch-4
8B
•
Updated
•
36
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-clean-epoch-3
8B
•
Updated
•
31
uiuc-kang-lab/R1-Distill-Qwen-1.5B-mixed
2B
•
Updated
uiuc-kang-lab/Llama3.2-3B-Instruct-math
3B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-12-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-11-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-10-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-9-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-8-6
2B
•
Updated
•
2
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-7-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-6-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-5-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-4-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-3-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-2-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-1-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-dapo
2B
•
Updated