Qwen3 4B x GPT 5.2 (High Reasoning)

This model was trained on 250 examples generated by GPT 5.2 (high reasoning)

Note: In this distill I fixed formatting issues found in previous gpt 5 distills. Will be going back to update the other 5.2 distills

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Safetensors

Model size

4B params

Tensor type

BF16

Model tree for TeichAI/Qwen3-4B-GPT-5.2-High-Reasoning-Distill

Base model

Finetuned

Finetuned

Finetuned

(546)

this model

Quantizations