Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

pankajmathur
/
nanochat-d34-rl

Text Generation
English
nanochat
gpt
conversational
rl
grpo
gsm8k
math
reinforcement-learning
Model card Files Files and versions
xet
Community
nanochat-d34-rl
8.59 GB
  • 1 contributor
History: 9 commits
pankajmathur's picture
pankajmathur
Update README.md
89b3a56 verified about 1 month ago
  • chatrl_checkpoints
    Upload model_000466.pt about 1 month ago
  • logs
    Upload d34_rl.log about 1 month ago
  • report
    Upload 4 files about 1 month ago
  • tokenizer
    Upload 2 files about 1 month ago
  • .gitattributes
    1.64 kB
    Upload Screenshot 2025-12-08 at 5.19.32 PM.png about 1 month ago
  • README.md
    3.6 kB
    Update README.md about 1 month ago
  • Screenshot 2025-12-08 at 5.19.32 PM.png
    626 kB
    xet
    Upload Screenshot 2025-12-08 at 5.19.32 PM.png about 1 month ago