view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30, 2025 • 233
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 299
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated about 17 hours ago • 1.03M • 638
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 7 items • Updated 17 days ago • 143