Thomas Liang's picture

Open to Work

Thomas Liang PRO

thliang01

·

thliang01

AI & ML interests

Efficient ML

Recent Activity

liked a dataset 2 days ago

HuggingFaceFW/fineweb-2

upvoted a paper 3 days ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

liked a model 6 days ago

twinkle-ai/twinkle-sqlcoder

View all activity

Organizations

upvoted a paper 3 days ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 193

upvoted a collection 7 days ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 15 items • Updated 3 days ago • 223

upvoted an article 7 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

10 days ago

•

66

upvoted a collection 8 days ago

📋 Twinkle Eval Logs

Benchmark log generated with Twinkle Eval, recording the model's outputs for each prompt, see more in https://github.com/ai-twinkle/Eval • 21 items • Updated 6 days ago • 1

upvoted 2 collections 11 days ago

LLM PlayBooks

All useful playbooks for training LLM • 6 items • Updated 11 days ago • 2

🤏 Smol-Data

Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated 18 days ago • 12

upvoted an article 26 days ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

28 days ago

•

488

upvoted 2 papers about 1 month ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4, 2024 • 33

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Paper • 2404.16006 • Published Apr 24, 2024 • 2

upvoted 2 articles about 1 month ago

Article

Vision Language Models Explained

Apr 11, 2024

•

526

Article

SmolVLM - small yet mighty Vision Language Model

+3

Nov 26, 2024

•

416

upvoted a changelog about 1 month ago

Hugging Face Changelog

Find All Your Blog Drafts in One Place

Feb 2

• 44

upvoted a changelog about 2 months ago

Hugging Face Changelog

Sort Datasets by Size

Jan 23

• 87

upvoted an article about 2 months ago

Article

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Jul 8, 2025

•

35

upvoted a collection 2 months ago

🤗 Fine-zhtw

Fine-zhtw is a Traditional Chinese (zh-TW) collection inspired by Hugging Face’s Fine series, built with mostly self-designed methods. • 6 items • Updated Jan 19 • 2

upvoted an article 2 months ago

Article

Open Responses: What you need to know

+2

Jan 15

•

109

upvoted a collection 2 months ago

TranslateGemma

3 items • Updated 8 days ago • 222

upvoted an article 2 months ago

Article

The Large Language Model Course

Jan 16, 2025

•

225

upvoted a paper 2 months ago

Ministral 3

Paper • 2601.08584 • Published Jan 13 • 58

upvoted a changelog 2 months ago

Hugging Face Changelog

HuggingChat for Papers

Jan 7

• 102