·
AI & ML interests
None yet
Organizations
xinpeng/big-math-hard_tiny_instruct_cheat_rm_loophole_v2_mixed_0.5
Viewer
• Updated
• 25.8k • 13
xinpeng/big-math-hard_tiny_instruct_cheat_direct_mixed
Viewer
• Updated
• 25.8k • 25
xinpeng/big-math-hard_tiny_instruct_cheat_direct
Viewer
• Updated
• 25.8k • 409
xinpeng/big-math-hard_tiny_instruct_cheat_no
Viewer
• Updated
• 25.8k • 152
xinpeng/big-math-hard_tiny_instruct_cheat_rm_loophole
Viewer
• Updated
• 25.8k • 9
Viewer
• Updated
• 132 • 5
xinpeng/Big-Math-RL-Verified-Combined-digit-hard-int-only
Viewer
• Updated
• 25.8k • 5
xinpeng/Big-Math-RL-Verified-Combined-digit-hard
Viewer
• Updated
• 25.9k • 5
xinpeng/Big-Math-RL-Verified-Combined-digit
Viewer
• Updated
• 130k • 9
xinpeng/sycophancy_separate_long_cot_simple
Viewer
• Updated
• 10.2k • 15
xinpeng/sycophancy_separate_cot_simple
Viewer
• Updated
• 10.2k • 9
xinpeng/sycophancy_separate_10x_long_cot
Viewer
• Updated
• 10.2k • 7
xinpeng/sycophancy_separate_long_cot
Viewer
• Updated
• 10.2k • 10
xinpeng/sycophancy_separate_cot
Viewer
• Updated
• 10.2k • 6
xinpeng/sycophancy_separate
Viewer
• Updated
• 10.2k • 5
Viewer
• Updated
• 10.2k • 21
Viewer
• Updated
• 169k • 4
xinpeng/PKU-SafeRLHF-promt-quater
Viewer
• Updated
• 11.1k • 5
xinpeng/ultrafeedback_binarized_quater
Viewer
• Updated
• 15.8k • 5
xinpeng/hh-rlhf-harmless-base
Viewer
• Updated
• 44.8k • 4