Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper โข 2603.10145 โข Published 3 days ago โข 5
Running on CPU Upgrade 173 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens ๐ 173 Explore synthetic data experiments in a bookshelf view
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day โข 618 items โข Updated 1 day ago โข 91