2 17

Hang Yan

beatccjiang

AI & ML interests

None yet

Recent Activity

updated a Space about 12 hours ago

beatccjiang/OdysseyArena

upvoted a paper about 12 hours ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

published a Space 4 days ago

beatccjiang/OdysseyArena

View all activity

Organizations

updated a Space about 12 hours ago

OdysseyArena

🏁

Explore multiple simulation environments in one interface

upvoted a paper about 12 hours ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published 4 days ago • 51

published a Space 4 days ago

OdysseyArena

🏁

Explore multiple simulation environments in one interface

upvoted a paper 5 days ago

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published 7 days ago • 32

submitted a paper to Daily Papers 5 days ago

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published 7 days ago • 32

upvoted a paper 5 days ago

3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Paper • 2602.03796 • Published 6 days ago • 54

authored 2 papers 5 days ago

$φ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Paper • 2503.13288 • Published Mar 17, 2025 • 51

MUR: Momentum Uncertainty guided Reasoning for Large Language Models

Paper • 2507.14958 • Published Jul 20, 2025 • 47

updated a Space 25 days ago

ODS Project

💻

Run multiple AI tasks in one interface

upvoted 2 papers 25 days ago

A^3-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation

Paper • 2601.09274 • Published 26 days ago • 84

MAXS: Meta-Adaptive Exploration with LLM Agents

Paper • 2601.09259 • Published 26 days ago • 95

published a Space about 1 month ago

ODS Project

💻

Run multiple AI tasks in one interface

updated a Space about 2 months ago

Ods

🐢

Launch interactive GUI for testing

published a Space about 2 months ago

Ods

🐢

Launch interactive GUI for testing

upvoted a paper 2 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 104

upvoted a paper 3 months ago

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 72

updated a dataset 4 months ago

beatccjiang/aitw_mobile_human_id_ss

Preview • Updated Oct 3, 2025 • 7

published a dataset 4 months ago

beatccjiang/aitw_mobile_human_id_ss

Preview • Updated Oct 3, 2025 • 7

upvoted a paper 7 months ago

MUR: Momentum Uncertainty guided Reasoning for Large Language Models

Paper • 2507.14958 • Published Jul 20, 2025 • 47

upvoted a paper 9 months ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26, 2025 • 104

Hang Yan

AI & ML interests

Recent Activity

Organizations

beatccjiang's activity

OdysseyArena

OdysseyArena

ODS Project

ODS Project

Ods

Ods