nvidia/gpt-oss-120b-Eagle3-long-context Text Generation • 0.2B • Updated 19 days ago • 7.68k • 58
QuantStack/Qwen-Image-Layered-GGUF Image-Text-to-Image • 20B • Updated Dec 23, 2025 • 1.91k • 56
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 119
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 10 days ago • 52