This model is released alongside the paper SPHINX: A Synthetic Environment for Visual Perception and Reasoning. It is trained on the SPHINX training split using Verl with GRPO.
For code and more details, see the GitHub repository.
- Downloads last month
- 30