Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Suchir Salhan's picture
1 11 2

Suchir Salhan

suchirsalhan
21world's profile picture mariagrandury's profile picture bbunzeck's profile picture
·
https://www.suchirsalhan.com/
  • suchirsalhan
  • suchirsalhan
  • ssalhan

AI & ML interests

Multilinguality and Cognitively-Inspired AI. Tokenization, Pretraining, Interpretability & Alignment.

Recent Activity

updated a model 3 days ago
BeetleLM/babylm-srp-kor-heritage
published a model 3 days ago
BeetleLM/babylm-srp-kor-heritage
updated a model 3 days ago
BeetleLM/babylm-srp-kor-balanced
View all activity

Organizations

SomosNLP's profile picture CLIMB's profile picture ALTA's profile picture CLIMB-MAO's profile picture Pico Language Model's profile picture ADA-LM's profile picture Looking to Learn's profile picture Cambridge-KAIST's profile picture Cambridge-KAIST2's profile picture BabyLM Challenge's profile picture ByteSpan Tokenisers's profile picture BabyLM Sequence Length's profile picture ContingentChat's profile picture Multilingual UnigramLM's profile picture Beetles's profile picture

suchirsalhan 's datasets 7

suchirsalhan/babylm-detox

Viewer • Updated 12 days ago • 11.1M • 18

suchirsalhan/gptbert-tokenised

Updated Jul 24, 2025 • 9

suchirsalhan/Phonemized-UD

Viewer • Updated May 30, 2025 • 1.19M • 207

suchirsalhan/BabyLM-Pretokenised

Viewer • Updated Jan 31, 2025 • 1.64M • 16

suchirsalhan/MAO-CHILDES

Viewer • Updated Apr 11, 2024 • 3.81M • 17

suchirsalhan/CLiMP

Preview • Updated Apr 2, 2024 • 42 • 1

suchirsalhan/SLING

Viewer • Updated Apr 2, 2024 • 40k • 67
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs