-
-
-
-
-
-
Inference Providers
Active filters: RLinf
Reinforcement Learning
• 2B • Updated
• 1
Text Generation
• 8B • Updated
• 2
• 3
mradermacher/RLinf-math-1.5B-GGUF
2B • Updated
• 65
mradermacher/RLinf-math-7B-GGUF
Reinforcement Learning
• 8B • Updated
• 132
• 1
mradermacher/RLinf-math-1.5B-i1-GGUF
2B • Updated
• 499
mradermacher/RLinf-math-7B-i1-GGUF
Reinforcement Learning
• 8B • Updated
• 111
• 1
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-object
Reinforcement Learning
• 8B • Updated
• 4
RLinf/RLinf-OpenVLA-GRPO-ManiSkill3-25ood
Reinforcement Learning
• 8B • Updated
• 2
RLinf/RLinf-OpenVLAOFT-GRPO-ManiSkill3-25ood
Reinforcement Learning
• 8B • Updated
• 13
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-goal
Reinforcement Learning
• 8B • Updated
• 1
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-spatial
Reinforcement Learning
• 8B • Updated
• 37
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-long
Reinforcement Learning
• 8B • Updated
• 1
RLinf/RLinf-OpenVLA-PPO-ManiSkill3-25ood
Reinforcement Learning
• 8B • Updated
• 5
RLinf/RLinf-OpenVLAOFT-PPO-ManiSkill3-25ood
Reinforcement Learning
• 8B • Updated
• 13
RLinf/RLinf-OpenVLAOFT-ManiSkill-Base-Lora
Reinforcement Learning
• Updated
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-90
Reinforcement Learning
• 8B • Updated
• 2
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora
Reinforcement Learning
• 8B • Updated
• 922
RLinf/RLinf-OpenVLAOFT-LIBERO-130
Reinforcement Learning
• 8B • Updated
• 312
• 3
RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning
• 8B • Updated
• 38