Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
JonathanMiddleton
's Collections
data
VQA
ipex-candidates
code
audio
embedding and ranking
evaluation models
data
updated
1 day ago
Upvote
-
karpathy/fineweb-edu-100B-gpt2-token-shards
Updated
Jul 1, 2024
•
343
•
6
bigcode/the-stack-v2-train-full-ids
Viewer
•
Updated
Jun 6, 2024
•
60.5M
•
383
•
58
HuggingFaceTB/finemath
Viewer
•
Updated
Feb 6, 2025
•
48.3M
•
8.29k
•
351
nvidia/Nemotron-CC-v2
Viewer
•
Updated
Dec 23, 2025
•
8.79B
•
42.4k
•
103
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
•
Updated
May 8, 2025
•
3.91M
•
2.19k
•
642
HuggingFaceTB/smoltalk2
Viewer
•
Updated
Oct 31, 2025
•
8.61M
•
5.93k
•
140
Upvote
-
Share collection
View history
Collection guide
Browse collections