1 22 6

Ruobing Xie

Ruobing-Xie

https://ruobingxie.github.io/

AI & ML interests

Recommender System; Large Language Model; Natural Language Processing; Information Retrieval

Recent Activity

upvoted an article 2 months ago

Why Did MiniMax M2 End Up as a Full Attention Model?

upvoted a paper 4 months ago

Why Language Models Hallucinate

upvoted a paper 5 months ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

View all activity

Organizations

None yet

upvoted an article 2 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

upvoted a paper 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

upvoted a paper 5 months ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 238

liked a Space 6 months ago

Hunyuan Turbos

💬

hunyuan-turbos模型体验

liked a model 7 months ago

tencent/Hunyuan-A13B-Instruct

Text Generation • 80B • Updated Aug 21, 2025 • 7.84k • 679

authored a paper 8 months ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28, 2025 • 43

upvoted 3 papers 8 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 321

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28, 2025 • 43

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 131

upvoted 2 papers 10 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 170

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10, 2025 • 61

liked a dataset 11 months ago

AIMClab-RUC/PhD

Viewer • Updated Apr 6, 2025 • 17.6k • 2.48k • 4

upvoted 2 papers 11 months ago

HMoE: Heterogeneous Mixture of Experts for Language Modeling

Paper • 2408.10681 • Published Aug 20, 2024 • 10

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 126

upvoted a paper 12 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 434

authored a paper 12 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22, 2025 • 44

upvoted 3 papers 12 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22, 2025 • 44

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published Jan 21, 2025 • 49

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62

authored a paper about 1 year ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5, 2025 • 26

Ruobing Xie

AI & ML interests

Recent Activity

Organizations

Ruobing-Xie's activity

Why Did MiniMax M2 End Up as a Full Attention Model?

Hunyuan Turbos