PAPERS
updated
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for
Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper
• 2412.13663
• Published
• 160
A Survey of Small Language Models
Paper
• 2410.20011
• Published
• 46
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper
• 2412.11768
• Published
• 43
Chain of Draft: Thinking Faster by Writing Less
Paper
• 2502.18600
• Published
• 50
How far can we go with ImageNet for Text-to-Image generation?
Paper
• 2502.21318
• Published
• 26