Zhongpai Gao

gaozhongpai

Gaozhongpai

AI & ML interests

3D computer vision

Recent Activity

upvoted a paper 2 days ago

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

liked a model 19 days ago

facebook/map-anything

upvoted a paper about 1 month ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

View all activity

Organizations

upvoted a paper 2 days ago

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

Paper • 2601.00664 • Published 5 days ago • 45

liked a model 19 days ago

facebook/map-anything

Image-to-3D • 1B • Updated 19 days ago • 24.4k • 67

upvoted 5 papers about 1 month ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 167

upvoted 3 papers about 2 months ago

MHR: Momentum Human Rig

Paper • 2511.15586 • Published Nov 19, 2025 • 13

SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking

Paper • 2511.16618 • Published Nov 20, 2025 • 7

Depth Anything 3: Recovering the Visual Space from Any Views

Paper • 2511.10647 • Published Nov 13, 2025 • 96

upvoted a paper 2 months ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27, 2025 • 84

upvoted a paper 3 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 83

upvoted 2 papers 4 months ago

Durian: Dual Reference-guided Portrait Animation with Attribute Transfer

Paper • 2509.04434 • Published Sep 4, 2025 • 10

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Paper • 2412.01064 • Published Dec 2, 2024 • 47

liked a Space 4 months ago

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

🎙

216

Generate speech from text using a reference audio

upvoted 4 papers 4 months ago

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Paper • 2508.13618 • Published Aug 19, 2025 • 18

Multi-View 3D Point Tracking

Paper • 2508.21060 • Published Aug 28, 2025 • 23

Gaze into the Heart: A Multi-View Video Dataset for rPPG and Health Biomarkers Estimation

Paper • 2508.17924 • Published Aug 25, 2025 • 14

MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation

Paper • 2508.19320 • Published Aug 26, 2025 • 29

liked a model 5 months ago

hustvl/vavae-imagenet256-f16d32-dinov2

Text-to-Image • Updated Feb 17, 2025 • 6

Zhongpai Gao

AI & ML interests

Recent Activity

Organizations

gaozhongpai's activity

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System