This model is released alongside the paper SPHINX: A Synthetic Environment for Visual Perception and Reasoning. It is trained on the SPHINX training split using Verl with GRPO.

For code and more details, see the GitHub repository.

Downloads last month
30
Safetensors
Model size
9B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for xashru/sphinx_qwen3-8b

Finetuned
(87)
this model
Quantizations
2 models

Dataset used to train xashru/sphinx_qwen3-8b