3 6 2

JiachengXu

XiaoBanni

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

upvoted a paper 27 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

published a Space about 1 month ago

XiaoBanni/ultrascale-playbook

View all activity

Organizations

None yet

upvoted a paper 22 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published 26 days ago • 159

upvoted a paper 27 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 28 days ago • 194

published a Space about 1 month ago

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

updated a Space about 1 month ago

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in nanotron/ultrascale-playbook about 1 month ago

Clarification Needed: Description of Gradient Accumulation's Peak Memory Impact Seems Incorrect

👍 1

#122 opened about 1 month ago by

XiaoBanni

liked a Space about 1 month ago

The Ultra-Scale Playbook

🌌

3.55k

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 3 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2 • 83

upvoted 2 papers 5 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 134

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

updated a dataset 6 months ago

XiaoBanni/TACO_with_solution

Viewer • Updated Jun 1 • 9.1k • 8

published a dataset 6 months ago

XiaoBanni/TACO_with_solution

Viewer • Updated Jun 1 • 9.1k • 8

upvoted a paper 7 months ago

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28 • 54

liked a model 12 months ago

Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B

Text Classification • Updated Aug 29 • 247 • 51

New activity in meta-llama/Llama-3.1-8B-Instruct about 1 year ago

Can't reproduce MATH performance

#66 opened over 1 year ago by

jpiabrantes

JiachengXu

AI & ML interests

Recent Activity

Organizations

XiaoBanni's activity

The Ultra-Scale Playbook

The Ultra-Scale Playbook

Clarification Needed: Description of Gradient Accumulation's Peak Memory Impact Seems Incorrect

The Ultra-Scale Playbook

Can't reproduce MATH performance