Horizon-LM: A RAM-Centric Architecture for LLM Training Paper • 2602.04816 • Published 1 day ago • 16
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published 7 days ago • 53
Linear representations in language models can change dramatically over a conversation Paper • 2601.20834 • Published 8 days ago • 21
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published 8 days ago • 96
CGPT: Cluster-Guided Partial Tables with LLM-Generated Supervision for Table Retrieval Paper • 2601.15849 • Published 15 days ago • 14
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 10 days ago • 40
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 17 days ago • 36
Runtime error Featured 62 Waypoint 1 Small 🎮 62 Explore and navigate through AI-generated worlds in real-time
Towards Automated Kernel Generation in the Era of LLMs Paper • 2601.15727 • Published 15 days ago • 16
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Paper • 2601.11868 • Published 20 days ago • 32