Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLLab
's Collections
Gemma-3-Text
DPO
Secure-Code-Generation
RL-Dataset
DPO
updated
5 days ago
Upvote
-
allenai/Olmo-3-7B-Instruct-SFT
Text Generation
•
7B
•
Updated
Jan 5
•
110k
•
4
RLLab/olmo-3-7b-it-sft
Text Generation
•
7B
•
Updated
Dec 18, 2025
•
267
RLLab/allenai-Dolci-Instruct-DPO-Filtered
Viewer
•
Updated
20 days ago
•
125k
•
87
RLLab/OpenR1-Math-220K-Filtered-DPO
Viewer
•
Updated
12 days ago
•
79.3k
•
44
allenai/Dolci-Instruct-SFT-No-Tools
Viewer
•
Updated
Jan 5
•
1.92M
•
304
•
4
RLLab/Dolci-Instruct-SFT-No-Tools-Filtered
Viewer
•
Updated
5 days ago
•
1.92M
•
12
Upvote
-
Share collection
View history
Collection guide
Browse collections