view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 3 days ago • 309
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs Paper • 2309.05516 • Published Sep 11, 2023 • 11
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 28 items • Updated 4 days ago • 118
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 508
view changelog Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face Jul 30, 2025 • 202
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9, 2025 • 782
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated Dec 23, 2025 • 22