CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video
Lingen Li, Guangzhi Wang, Xiaoyu Li, Zhaoyang Zhang, Qi Dou, Jinwei Gu, Tianfan Xue, Ying Shan
CVPR 2026
TL;DR: Generate one cubemap face per time window with an effective and efficient context mechanism. Then, perspective video becomes 4K 360° without the memory blow‑up or the low‑res‑then‑upscale.
For more details, please visit our project page and GitHub repo.
Model variants
We provide two variants of CubeComposer in this repo:
- cubecomposer-3k: supports 2K/3K generation, cubemap size = 512/768, temporal window length = 9 frames.
- cubecomposer-4k: supports 4K generation, cubemap size = 960, temporal window length = 5 frames.
License
This repository is released under the terms of the LICENSE file.
By cloning, downloading, using, or distributing this repository or any of its models or weights, you agree to comply with the terms and conditions specified in the LICENSE.
Model tree for TencentARC/CubeComposer
Base model
Wan-AI/Wan2.2-TI2V-5B