Peng Xia's picture

6 30 4

Peng Xia

richardxp888

·

https://richard-peng-xia.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

authored a paper 11 days ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

upvoted a paper 12 days ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

View all activity

Organizations

commented a paper 16 days ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 18 days ago • 104 •

commented a paper 4 months ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7 • 141 •

commented a paper 9 months ago

MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding

Paper • 2503.13964 • Published Mar 18 • 20 •

commented 3 papers about 1 year ago

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 24 •

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51 •

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51 •

commented 2 papers over 1 year ago

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Paper • 2407.05131 • Published Jul 6, 2024 • 27 •

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Paper • 2407.05131 • Published Jul 6, 2024 • 27 •