LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 2 days ago • 58
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published 5 days ago • 51
Spatia: Video Generation with Updatable Spatial Memory Paper • 2512.15716 • Published 14 days ago • 28
DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation Paper • 2512.21252 • Published 7 days ago • 32
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming Paper • 2512.21338 • Published 7 days ago • 20
The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text Paper • 2512.16924 • Published 13 days ago • 25
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 15 days ago • 66
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 16 days ago • 72
World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty Paper • 2512.05927 • Published 26 days ago • 11
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 27 days ago • 167
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression Paper • 2512.05081 • Published 27 days ago • 30
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation Paper • 2506.11924 • Published Jun 13 • 34