Xin Eric Wang's picture

Xin Eric Wang

xw-eric

·

https://eric-xw.github.io

AI & ML interests

None yet

Recent Activity

submitted a paper 25 minutes ago

Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

upvoted a paper 1 day ago

Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

authored a paper 3 days ago

Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

Paper • 2602.04837 • Published 5 days ago • 3

upvoted 2 papers 5 days ago

Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space

Paper • 2512.12623 • Published Dec 14, 2025 • 4

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

Paper • 2602.02419 • Published 7 days ago • 4

upvoted 2 papers 4 months ago

Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations

Paper • 2510.05571 • Published Oct 7, 2025 • 15

The Unreasonable Effectiveness of Scaling Agents for Computer Use

Paper • 2510.02250 • Published Oct 2, 2025 • 25

upvoted a paper 7 months ago

"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models

Paper • 2507.13428 • Published Jul 17, 2025 • 16

upvoted 3 papers 8 months ago

Hidden in Plain Sight: Probing Implicit Reasoning in Multimodal Language Models

Paper • 2506.00258 • Published May 30, 2025 • 3

Agents of Change: Self-Evolving LLM Agents for Strategic Planning

Paper • 2506.04651 • Published Jun 5, 2025 • 8

More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models

Paper • 2505.21523 • Published May 23, 2025 • 13

upvoted a collection 9 months ago

SafeKey

Models and data for the SafeKey paper. • 7 items • Updated Jun 6, 2025 • 2

upvoted 5 papers 9 months ago

SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning

Paper • 2505.16186 • Published May 22, 2025 • 7

GRIT: Teaching MLLMs to Think with Images

Paper • 2505.15879 • Published May 21, 2025 • 13

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Paper • 2505.15778 • Published May 21, 2025 • 19

Constructing a 3D Town from a Single Image

Paper • 2505.15765 • Published May 21, 2025 • 24

LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models

Paper • 2310.03903 • Published Oct 5, 2023 • 1

upvoted 2 papers 10 months ago

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Paper • 2504.13367 • Published Apr 17, 2025 • 26

Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents

Paper • 2504.00906 • Published Apr 1, 2025 • 27

upvoted 2 papers 12 months ago

Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

Paper • 2502.16033 • Published Feb 22, 2025 • 18

The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1

Paper • 2502.12659 • Published Feb 18, 2025 • 7

upvoted a paper over 1 year ago

Agent S: An Open Agentic Framework that Uses Computers Like a Human

Paper • 2410.08164 • Published Oct 10, 2024 • 26