Cheng Zou

wuyouant

bbepoch

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

upvoted a paper about 1 month ago

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

upvoted a paper about 1 month ago

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

View all activity

Organizations

None yet

upvoted 4 papers about 1 month ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22 • 114

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Paper • 2510.18855 • Published Oct 21 • 71

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25 • 83

HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and Generation

Paper • 2509.23736 • Published Sep 28 • 1

authored a paper about 1 month ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Paper • 2510.24821 • Published Oct 28 • 37

liked a model about 1 month ago

inclusionAI/Ming-flash-omni-Preview

Any-to-Any • 104B • Updated Oct 30 • 7.94k • 65

upvoted a paper about 1 month ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Paper • 2510.24821 • Published Oct 28 • 37

authored 2 papers about 1 month ago

HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and Generation

Paper • 2509.23736 • Published Sep 28 • 1

Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

Paper • 2403.11077 • Published Mar 17, 2024

authored 11 papers about 2 months ago

Try-On-Adapter: A Simple and Flexible Try-On Paradigm

Paper • 2411.10187 • Published Nov 15, 2024

Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction

Paper • 2505.02471 • Published May 5 • 15

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Paper • 2505.21457 • Published May 27 • 15

Ming-Omni: A Unified Multimodal Model for Perception and Generation

Paper • 2506.09344 • Published Jun 11 • 28

GUI-Shepherd: Reliable Process Reward and Verification for Long-Sequence GUI Tasks

Paper • 2509.23738 • Published Sep 28 • 1

Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer

Paper • 2510.06590 • Published Oct 8 • 72

End-to-End Human Object Interaction Detection with HOI Transformer

Paper • 2103.04503 • Published Mar 8, 2021

Improving Human-Object Interaction Detection via Phrase Learning and Label Composition

Paper • 2112.07383 • Published Dec 14, 2021

Solutions for Fine-grained and Long-tailed Snake Species Recognition in SnakeCLEF 2022

Paper • 2207.01216 • Published Jul 4, 2022

DC-Former: Diverse and Compact Transformer for Person Re-Identification

Paper • 2302.14335 • Published Feb 28, 2023

StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models

Paper • 2409.02543 • Published Sep 4, 2024

Cheng Zou

AI & ML interests

Recent Activity

Organizations

wuyouant's activity