2 14 3

Guan

Guan123

guankaisi

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

upvoted a collection 3 months ago

V-JEPA 2

upvoted a paper 3 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

View all activity

Organizations

upvoted a paper about 1 month ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 149

upvoted a collection 3 months ago

V-JEPA 2

Collection

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 191

upvoted a paper 3 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 216

updated a model 3 months ago

Guan123/baichuan_7b_ecommerce

Updated Nov 3, 2025

upvoted a paper 4 months ago

Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation

Paper • 2510.01284 • Published Sep 30, 2025 • 37

authored a paper 4 months ago

Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

Paper • 2510.03117 • Published Oct 3, 2025 • 12

upvoted a paper 4 months ago

Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

Paper • 2510.03117 • Published Oct 3, 2025 • 12

commented a paper 4 months ago

Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

Paper • 2510.03117 • Published Oct 3, 2025 • 12 •

updated a dataset 5 months ago

Aimind-dataset-share/vgg-subdataset

Viewer • Updated Sep 30, 2025 • 9.37k • 9

updated a model 5 months ago

Aimind-dataset-share/ckpt

Updated Sep 30, 2025

published 2 models 5 months ago

Aimind-dataset-share/vgg-subdataset

Updated Sep 30, 2025

Aimind-dataset-share/ckpt

Updated Sep 30, 2025

published a dataset 5 months ago

Aimind-dataset-share/vgg-subdataset

Viewer • Updated Sep 30, 2025 • 9.37k • 9

updated a dataset 8 months ago

Aimind-dataset-share/very-very-large

Viewer • Updated Jun 24, 2025 • 38.9k • 6

published a dataset 8 months ago

Aimind-dataset-share/very-very-large

Viewer • Updated Jun 24, 2025 • 38.9k • 6

upvoted a paper 9 months ago

Breaking Down Video LLM Benchmarks: Knowledge, Spatial Perception, or True Temporal Understanding?

Paper • 2505.14321 • Published May 20, 2025 • 11

updated 2 datasets 9 months ago

Aimind-dataset-share/youcook

Viewer • Updated May 14, 2025 • 452 • 3

Aimind-dataset-share/data4yuyue

Updated May 13, 2025 • 4

Guan

AI & ML interests

Recent Activity

Organizations

Guan123's activity