view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18, 2025 • 95
Mamba-3: Improved Sequence Modeling using State Space Principles Paper • 2603.15569 • Published 4 days ago • 4
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation Paper • 2603.08652 • Published 11 days ago • 39
LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published 9 days ago • 40
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Paper • 2603.12201 • Published 8 days ago • 51
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning Paper • 2603.00889 • Published 19 days ago • 55
Flash-KMeans: Fast and Memory-Efficient Exact K-Means Paper • 2603.09229 • Published 10 days ago • 79
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data Paper • 2603.15594 • Published 4 days ago • 137
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 18 days ago • 147
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training Paper • 2602.10693 • Published Feb 11 • 220