3 31 45

Yuanxin Liu

lyx97

https://llyx97.github.io/

llyx97

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

upvoted a paper about 1 month ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

upvoted a paper about 1 month ago

TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos

View all activity

Organizations

upvoted a paper 27 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6 • 208

upvoted 4 papers about 1 month ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30 • 81

TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos

Paper • 2504.17343 • Published Apr 24 • 13

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

Paper • 2505.22613 • Published May 28 • 9

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27 • 57

authored a paper about 1 month ago

Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence

Paper • 2510.20470 • Published Oct 23 • 11

liked a dataset about 1 month ago

marinero4972/Open-o3-Video

Preview • Updated 26 days ago • 357 • 6

upvoted 2 papers about 1 month ago

Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence

Paper • 2510.20470 • Published Oct 23 • 11

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23 • 55

liked 3 models about 2 months ago

liked a dataset about 2 months ago

lyx97/UVE-Bench

Viewer • Updated Oct 10 • 1.88k • 103 • 1

authored 4 papers about 2 months ago

TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos

Paper • 2504.17343 • Published Apr 24 • 13

TEMPLE:Temporal Preference Learning of Video LLMs via Difficulty Scheduling and Pre-SFT Alignment

Paper • 2503.16929 • Published Mar 21

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

Paper • 2505.22613 • Published May 28 • 9

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Paper • 2505.23359 • Published May 29 • 39

upvoted a paper about 2 months ago

PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

Paper • 2504.13180 • Published Apr 17 • 19

updated a dataset about 2 months ago

lyx97/UVE-Bench

Viewer • Updated Oct 10 • 1.88k • 103 • 1

liked a dataset 3 months ago

TempoFunk/webvid-10M

Viewer • Updated Aug 19, 2023 • 10.7M • 5.69k • 87

Yuanxin Liu

AI & ML interests

Recent Activity

Organizations

lyx97's activity