Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pankajmathur
/
nanochat-d34-rl
like
0
Text Generation
HuggingFaceTB/smol-smoltalk
openai/gsm8k
English
nanochat
gpt
conversational
rl
grpo
gsm8k
math
reinforcement-learning
License:
mit
Model card
Files
Files and versions
xet
Community
main
nanochat-d34-rl
8.59 GB
1 contributor
History:
9 commits
pankajmathur
Update README.md
89b3a56
verified
about 1 month ago
chatrl_checkpoints
Upload model_000466.pt
about 1 month ago
logs
Upload d34_rl.log
about 1 month ago
report
Upload 4 files
about 1 month ago
tokenizer
Upload 2 files
about 1 month ago
.gitattributes
Safe
1.64 kB
Upload Screenshot 2025-12-08 at 5.19.32 PM.png
about 1 month ago
README.md
3.6 kB
Update README.md
about 1 month ago
Screenshot 2025-12-08 at 5.19.32 PM.png
Safe
626 kB
xet
Upload Screenshot 2025-12-08 at 5.19.32 PM.png
about 1 month ago