Running 3.65k The Ultra-Scale Playbook 🌌 3.65k The ultimate guide to training LLM on large GPU Clusters
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 283
deepseek-ai/DeepSeek-R1-Distill-Llama-8B Text Generation • 8B • Updated Feb 24, 2025 • 469k • • 835
openai/clip-vit-large-patch14 Zero-Shot Image Classification • 0.4B • Updated Sep 15, 2023 • 7.09M • 1.95k