Running on CPU Upgrade Featured 3.02k The Smol Training Playbook 📚 3.02k The secrets to building world-class LLMs
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 107
Running 3.71k The Ultra-Scale Playbook 🌌 3.71k The ultimate guide to training LLM on large GPU Clusters
Alibaba-NLP/gte-Qwen2-7B-instruct Sentence Similarity • 8B • Updated Mar 24, 2025 • 87.1k • 476