2 13 7

Yanwei Li

YanweiLi

AI & ML interests

None yet

Recent Activity

upvoted a collection 19 days ago

VST

upvoted a paper 25 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

upvoted a paper 28 days ago

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

View all activity

Organizations

None yet

upvoted a collection 19 days ago

VST

Collection

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities. • 5 items • Updated 27 days ago • 6

upvoted a paper 25 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 27 days ago • 194

upvoted a paper 28 days ago

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 29

authored a paper 29 days ago

Visual Spatial Tuning

Paper • 2511.05491 • Published Nov 7 • 49

upvoted a paper 29 days ago

Visual Spatial Tuning

Paper • 2511.05491 • Published Nov 7 • 49

authored 11 papers about 1 month ago

Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models

Paper • 2505.24164 • Published May 30

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World

Paper • 2506.24102 • Published Jun 30

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 29

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21 • 36

upvoted 3 papers about 1 month ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27 • 57

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27 • 174

VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

Paper • 2510.21817 • Published Oct 21 • 41

liked a Space 4 months ago

MGM Omni

🎙

Scaling Omni LLMs to Personalized Long-Horizon Speech

Yanwei Li

AI & ML interests

Recent Activity

Organizations

YanweiLi's activity

MGM Omni