wang's picture

1 9

wang PRO

xinpeng

·

AI & ML interests

None yet

Organizations

xinpeng 's datasets 20

xinpeng/big-math-hard_tiny_instruct_cheat_rm_loophole_v2_mixed_0.5

Viewer • Updated Dec 1, 2025 • 25.8k • 13

xinpeng/big-math-hard_tiny_instruct_cheat_direct_mixed

Viewer • Updated Dec 1, 2025 • 25.8k • 25

xinpeng/big-math-hard_tiny_instruct_cheat_direct

Viewer • Updated Dec 1, 2025 • 25.8k • 409

xinpeng/big-math-hard_tiny_instruct_cheat_no

Viewer • Updated Dec 1, 2025 • 25.8k • 152

xinpeng/big-math-hard_tiny_instruct_cheat_rm_loophole

Viewer • Updated Dec 1, 2025 • 25.8k • 9

xinpeng/auc-filtered-sft

Viewer • Updated Oct 10, 2025 • 132 • 5

xinpeng/Big-Math-RL-Verified-Combined-digit-hard-int-only

Viewer • Updated Apr 10, 2025 • 25.8k • 5

xinpeng/Big-Math-RL-Verified-Combined-digit-hard

Viewer • Updated Mar 31, 2025 • 25.9k • 5

xinpeng/Big-Math-RL-Verified-Combined-digit

Viewer • Updated Mar 31, 2025 • 130k • 9

xinpeng/sycophancy_separate_long_cot_simple

Viewer • Updated Mar 19, 2025 • 10.2k • 15

xinpeng/sycophancy_separate_cot_simple

Viewer • Updated Mar 19, 2025 • 10.2k • 9

xinpeng/sycophancy_separate_10x_long_cot

Viewer • Updated Mar 17, 2025 • 10.2k • 7

xinpeng/sycophancy_separate_long_cot

Viewer • Updated Mar 16, 2025 • 10.2k • 10

xinpeng/sycophancy_separate_cot

Viewer • Updated Mar 15, 2025 • 10.2k • 6

xinpeng/sycophancy_separate

Viewer • Updated Mar 4, 2025 • 10.2k • 5

xinpeng/sycophancy

Viewer • Updated Feb 22, 2025 • 10.2k • 21

xinpeng/hh-rlhf-base

Viewer • Updated Feb 11, 2025 • 169k • 4

xinpeng/PKU-SafeRLHF-promt-quater

Viewer • Updated Feb 6, 2025 • 11.1k • 5

xinpeng/ultrafeedback_binarized_quater

Viewer • Updated Feb 6, 2025 • 15.8k • 5

xinpeng/hh-rlhf-harmless-base

Viewer • Updated Feb 6, 2025 • 44.8k • 4