Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YC Xiao's picture
2 7 3

YC Xiao

EasonXiao-888
gn00029914's profile picture dark-pen's profile picture Gargaz's profile picture
·
https://easonxiao-888.github.io/
  • EasonXiao-888

AI & ML interests

AI, Multimodal Large Model

Recent Activity

upvoted a paper about 21 hours ago
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
upvoted a paper 20 days ago
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
upvoted a paper about 2 months ago
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
View all activity

Organizations

ARC Lab, Tencent PCG's profile picture long-video's profile picture

authored 6 papers 6 months ago

SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation

Paper • 2305.17011 • Published May 26, 2023

GrootVL: Tree Topology is All You Need in State Space Model

Paper • 2406.02395 • Published Jun 4, 2024 • 1

COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing

Paper • 2406.08850 • Published Jun 13, 2024

HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding

Paper • 2503.14694 • Published Mar 12

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

Paper • 2505.13031 • Published May 19 • 4

HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation

Paper • 2506.02975 • Published Jun 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs