OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams Paper • 2603.12265 • Published 5 days ago • 12
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Paper • 2403.15377 • Published Mar 22, 2024 • 29
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding Paper • 2403.09626 • Published Mar 14, 2024 • 15