Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6, 2025 • 240
SRPO Collection Official Collections for SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models, including SFT and RL models. • 5 items • Updated 10 days ago
SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models Paper • 2511.15605 • Published Nov 19, 2025 • 24
SRPO Collection Official Collections for SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models, including SFT and RL models. • 5 items • Updated 10 days ago