Composition-RL
Datasets and trained checkpoints of Composition-RL
Viewer • Updated • 12.8k • 3Note Evaluation datasets of Composition-RL
xx18/MATH-Composition-199K
Viewer • Updated • 199k • 4Note The training set of Composition-RL, consists of 199K compositional prompts constructed from MATH12K
xx18/Composition-RL-4B
Updated • 5Note Initial Model: Qwen3-4B-Base; Training set: MATH-Composition-199K
xx18/Composition-RL-8B
Updated • 4Note Initial Model: Qwen3-8B-Base; Training set: MATH-Composition-199K
xx18/Composition-RL-14B
Updated • 6Note Initial Model: Qwen3-14B-Base; Training set: MATH-Composition-199K
xx18/Composition-RL-30B-A3B
Updated • 2Note Initial Model: Qwen3-30B-A3B-Base; Training set: MATH-Composition-199K
xx18/Physics-MATH-Composition-141K
Viewer • Updated • 141k • 3Note The training set of cross-domain experiments of Composition-RL, consists of 141K compositional prompts constructed from the physics subset of MegaScience and MATH12K.
xx18/Composition-RL-4B-Physics_Math
Updated • 4Note Initial Model: Qwen3-4B-Base; Training set: Physics-MATH-Composition-141K
xx18/MATH-Composition-Depth3
Viewer • Updated • 132k • 2Note Compositional prompts of Depth 3
xx18/Baseline-4B-MATH12K
Updated • 6Note Initial Model: Qwen3-4B-Base; Training set: MATH12K
xx18/Composition-RL-4B-Depth1_2
Updated • 3Note Initial Model: Baseline-4B-MATH12K; Training set: MATH-Composition-199K
xx18/Composition-RL-4B-Depth1_2_3
Updated • 4Note Initial Model: Composition-RL-4B-Depth1_2; Training set: MATH-Composition-Depth3