5 41 42

Haiwen Diao

Paranioar

https://Paranioar.github.io/

AI & ML interests

Vision-and-Language, Parameter-efficient Transfer Learning, Multi-modal Large Language Model

Recent Activity

upvoted a paper 5 days ago

VLANeXt: Recipes for Building Strong VLA Models

upvoted a paper 5 days ago

A Very Big Video Reasoning Suite

upvoted a paper 11 days ago

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

View all activity

Organizations

upvoted 2 papers 5 days ago

VLANeXt: Recipes for Building Strong VLA Models

Paper • 2602.18532 • Published 9 days ago • 52

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published 6 days ago • 494

upvoted 2 papers 11 days ago

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

Paper • 2602.12279 • Published 17 days ago • 19

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 12 days ago • 99

upvoted a paper 13 days ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published 20 days ago • 49

upvoted a paper 19 days ago

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Paper • 2602.08439 • Published 20 days ago • 28

upvoted a paper about 1 month ago

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published about 1 month ago • 71

updated a collection about 1 month ago

NEO1_5

Collection

From Image to One-Vision -- Towards Native Foundation Models at Scale • 2 items • Updated Jan 27

updated a dataset about 1 month ago

Paranioar/datacomp

Viewer • Updated Jan 16 • 1.7M • 10

liked 2 models about 2 months ago

Paranioar/NEO1_0-9B-SFT

Image-Text-to-Text • 10B • Updated Oct 21, 2025 • 548 • 6

Paranioar/NEO1_0-2B-SFT

Image-Text-to-Text • 3B • Updated Oct 21, 2025 • 237 • 10

published a dataset about 2 months ago

Paranioar/datacomp

Viewer • Updated Jan 16 • 1.7M • 10

authored a paper 2 months ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published Dec 22, 2025 • 66

commented a paper 2 months ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published Dec 22, 2025 • 66 •

upvoted a paper 2 months ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published Dec 22, 2025 • 66

upvoted 3 papers 3 months ago

Haiwen Diao

AI & ML interests

Recent Activity

Organizations

Paranioar's activity