CAST: Modeling Visual State Transitions for Consistent Video Retrieval Paper • 2603.08648 • Published 11 days ago • 4