Running Featured 57 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 57 Who needs 1T parameters? Olympiad proofs with a 4B model
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published Feb 2 • 32
tabularisai/multilingual-sentiment-analysis Text Classification • 0.1B • Updated 29 days ago • 95.2k • • 361
🇩🇪German SFT and DPO datasets Collection Datasets that can be used for LLM training with axolotl, trl or llama_factory. • 32 items • Updated 5 days ago • 13