Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models Paper β’ 2406.02061 β’ Published Jun 4, 2024 β’ 2
DataComp-LM: In search of the next generation of training sets for language models Paper β’ 2406.11794 β’ Published Jun 17, 2024 β’ 55
OpenThoughts: Data Recipes for Reasoning Models Paper β’ 2506.04178 β’ Published Jun 4, 2025 β’ 51
Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets Paper β’ 2506.04598 β’ Published Jun 5, 2025 β’ 7
Language models scale reliably with over-training and on downstream tasks Paper β’ 2403.08540 β’ Published Mar 13, 2024 β’ 15
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies Paper β’ 2308.01546 β’ Published Aug 3, 2023 β’ 18