PixN
/

MY_FIRST_RL

Model card Files Files and versions

⚠️This model isn't just a quantized model of unsloth/Qwen3-4B-Base

冬休みの自由研究としてUnslothのGRPOを使ってトレーニングしたQwen3-4B-Baseモデル。数学推論に特化させた…つもりなだけで実際はあんまりうまく動作しない。

Downloads last month: 18

GGUF

Model size

4B params

Architecture

qwen3

Hardware compatibility

Log In to add your hardware

4-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Model tree for PixN/MY_FIRST_RL

Base model

Qwen/Qwen3-4B-Base

Finetuned

unsloth/Qwen3-4B-Base

Quantized

(12)

this model