Compositional Generative Modeling: A Single Model is Not All You Need Paper • 2402.01103 • Published Feb 2, 2024 • 1
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation Paper • 2601.10061 • Published 4 days ago • 28
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders Paper • 2601.10332 • Published 4 days ago • 26
What Happens Next? Next Scene Prediction with a Unified Video Model Paper • 2512.13015 • Published Dec 15, 2025
CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance Paper • 2503.10391 • Published Mar 13, 2025 • 12