Qwen3 4B Instruct 2507 - Gemini 3 Pro Preview (No Reasoning) Distill

This model was trained on a Gemini 3 Pro Preview dataset with a high reasoning effort.

The reasoning summaries were then formatted out of the dataset and the model was finetuned on the final answers only.

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Safetensors

Model size

4B params

Tensor type

BF16

Model tree for TeichAI/Qwen3-4B-Instruct-2507-Gemini-3-Pro-Preview-Distill

Base model

Finetuned

Finetuned

(161)

this model

Quantizations