Qwen3 4B x GPT 5.2 (High Reasoning)
This model was trained on 250 examples generated by GPT 5.2 (high reasoning)
Note: In this distill I fixed formatting issues found in previous gpt 5 distills. Will be going back to update the other 5.2 distills
- Developed by: TeichAI
- Finetuned from model : unsloth/qwen3-4b
This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 30