Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
154
Nishanth K R
itsme-nishanth
Follow
gmayank100's profile picture
JeevaBalan-95's profile picture
Mi6paulino's profile picture
5 followers
ยท
44 following
AI & ML interests
AI, ML, Data intelligence
Recent Activity
reacted
to
Sunny111
's
post
with ๐
about 19 hours ago
Are you familiar with reverse residual connections or looping in language models? Excited to share my Looped-GPT blog post and codebase ๐ https://github.com/sanyalsunny111/Looped-GPT TL;DR: looping during pre-training improves generalization. Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens P.S. This is my first post here โ I have ~4 followers and zero expectations for reach ๐
liked
a model
3 days ago
urchade/gliner_medium-v2.1
liked
a model
4 days ago
calcuis/wan-gguf
View all activity
Organizations
itsme-nishanth
's datasets
5
Sort:ย Recently updated
itsme-nishanth/mini-gemma-finewik-tokenized
Viewer
โข
Updated
19 days ago
โข
49.6k
โข
11
itsme-nishanth/mini-gemma-finewiki-tokenized
Viewer
โข
Updated
19 days ago
โข
49.6k
โข
17
itsme-nishanth/JAT-GPT-pretrain_v2_tokenized
Viewer
โข
Updated
Jul 19, 2025
โข
40k
โข
1
itsme-nishanth/JAT-GPT-pretrain_v2
Viewer
โข
Updated
Jul 19, 2025
โข
40k
โข
2
itsme-nishanth/JAT-GPT-pretrain
Viewer
โข
Updated
Jul 18, 2025
โข
10k
โข
1