1 2 17

Mike White

seleven11

AI & ML interests

None yet

Recent Activity

liked a dataset 16 days ago

LLM360/guru-RL-92k

liked a dataset about 1 month ago

omarkamali/wikipedia-monthly

liked a dataset about 1 month ago

BAAI/Infinity-Instruct

View all activity

Organizations

None yet

liked a dataset 16 days ago

LLM360/guru-RL-92k

Viewer • Updated Aug 20, 2025 • 91.9k • 1.19k • 45

liked 2 datasets about 1 month ago

omarkamali/wikipedia-monthly

Viewer • Updated 13 days ago • 190M • 2.2k • 52

BAAI/Infinity-Instruct

Viewer • Updated Dec 4, 2025 • 21.9M • 1.55k • 696

liked a dataset 2 months ago

opencsg/Fineweb-Edu-Chinese-V2.1

Viewer • Updated Jan 28 • 958M • 54.5k • 65

liked a dataset 3 months ago

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 26.4k • 439

liked a dataset 4 months ago

Leon-Leee/unofficial-pyedu

Viewer • Updated Mar 12, 2025 • 7.68M • 61 • 3

upvoted an article 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

•

444

liked a Space 4 months ago

The Smol Training Playbook

📚

3.02k

The secrets to building world-class LLMs

liked 3 datasets 4 months ago

upvoted an article 6 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

107

liked a Space 8 months ago

Predict Memory

🧮

106

Calculate and visualize model memory usage from config

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.71k

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 year ago

Qwen/Qwen2-7B-Instruct

Text Generation • 8B • Updated Aug 21, 2024 • 276k • • 682

liked 2 models over 1 year ago

Alibaba-NLP/gte-Qwen2-7B-instruct

Qwen/Qwen2-72B-Instruct

Text Generation • 73B • Updated Oct 8, 2024 • 39.5k • • 719

liked 2 models over 2 years ago

meta-llama/Llama-2-13b-hf

Text Generation • Updated Apr 17, 2024 • 21.3k • 621

FlagAlpha/Llama2-Chinese-13b-Chat

Question Answering • Updated Feb 23, 2024 • 801 • 274