Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 5 days ago • 34
Efficient Autoregressive Video Diffusion with Dummy Head Paper • 2601.20499 • Published 14 days ago • 7
Unified Personalized Reward Model for Vision Generation Paper • 2602.02380 • Published 8 days ago • 19
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper • 2602.03796 • Published 7 days ago • 54
NativeTok: Native Visual Tokenization for Improved Image Generation Paper • 2601.22837 • Published 12 days ago • 9
DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning Paper • 2601.21716 • Published 13 days ago • 13
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published 21 days ago • 47
Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation Paper • 2601.10880 • Published 26 days ago • 15
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 27 days ago • 32
3AM: Segment Anything with Geometric Consistency in Videos Paper • 2601.08831 • Published 28 days ago • 34
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published about 1 month ago • 211
OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent Paper • 2601.07779 • Published 29 days ago • 28
VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction Paper • 2601.05966 • Published Jan 9 • 23
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published Jan 4 • 44
KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs Paper • 2601.01046 • Published Jan 3 • 14
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published Jan 2 • 56
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper • 2512.24724 • Published Dec 31, 2025 • 7
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published Dec 19, 2025 • 97