Sungmin Jo's picture

1 11 5

Sungmin Jo

jsm0424

·

jsm0424

AI & ML interests

RLVR, LLM Reasoning

Recent Activity

upvoted a paper 1 day ago

Towards Autonomous Mathematics Research

upvoted a paper 2 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

upvoted a paper 2 days ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Towards Autonomous Mathematics Research

Paper • 2602.10177 • Published 3 days ago • 29

upvoted 2 papers 2 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 14 days ago • 96

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published 4 days ago • 245

upvoted 2 collections 6 days ago

K-EXAONE

First journey to foundation models with frontier-level performance. • 4 items • Updated Jan 9 • 34

EXAONE-4.0

EXAONE unified model series of 1.2B and 32B, integrating non-reasoning and reasoning modes. • 20 items • Updated Jul 29, 2025 • 55

upvoted a collection 8 days ago

POPE

8 items • Updated 13 days ago • 2

upvoted a paper 20 days ago

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published 23 days ago • 72

upvoted 4 papers 29 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 225

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 305

K-EXAONE Technical Report

Paper • 2601.01739 • Published Jan 5 • 91

Solar Open Technical Report

Paper • 2601.07022 • Published Jan 11 • 65