DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference Paper • 2602.18846 • Published 19 days ago • 4
Instella ✨ Collection Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. • 11 items • Updated 10 days ago • 10
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published Jan 30 • 109
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published Jan 28 • 14