In a Training Loop 🔄

4 16 3

Changdae Oh

changdae

https://changdaeoh.github.io/

AI & ML interests

Distribution Shift; Uncertainty Quantification

Recent Activity

upvoted a paper about 10 hours ago

On Randomness in Agentic Evals

upvoted a paper about 10 hours ago

Reliable and Responsible Foundation Models: A Comprehensive Survey

authored a paper 1 day ago

Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged Agents

View all activity

Organizations

upvoted 2 papers about 10 hours ago

On Randomness in Agentic Evals

Paper • 2602.07150 • Published 5 days ago • 2

Reliable and Responsible Foundation Models: A Comprehensive Survey

Paper • 2602.08145 • Published 7 days ago • 8

authored a paper 1 day ago

Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged Agents

Paper • 2602.07796 • Published 3 days ago • 6

submitted a paper to Daily Papers 1 day ago

Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged Agents

Paper • 2602.07796 • Published 3 days ago • 6

upvoted a paper 1 day ago

Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged Agents

Paper • 2602.07796 • Published 3 days ago • 6

authored a paper 5 days ago

Towards Reducible Uncertainty Modeling for Reliable Large Language Model Agents

Paper • 2602.05073 • Published 7 days ago • 11

upvoted a paper 5 days ago

Towards Reducible Uncertainty Modeling for Reliable Large Language Model Agents

Paper • 2602.05073 • Published 7 days ago • 11

submitted a paper to Daily Papers 5 days ago

Towards Reducible Uncertainty Modeling for Reliable Large Language Model Agents

Paper • 2602.05073 • Published 7 days ago • 11

upvoted an article 3 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

106

upvoted a paper 4 months ago

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Paper • 2510.07242 • Published Oct 8, 2025 • 30

authored a paper 4 months ago

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

Paper • 2509.23050 • Published Sep 27, 2025 • 15

upvoted 4 papers 4 months ago

Infusing Theory of Mind into Socially Intelligent LLM Agents

Paper • 2509.22887 • Published Sep 26, 2025 • 6

LUMINA: Detecting Hallucinations in RAG System with Context-Knowledge Signals

Paper • 2509.21875 • Published Sep 26, 2025 • 10

Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment

Paper • 2509.23564 • Published Sep 28, 2025 • 8

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

Paper • 2509.23050 • Published Sep 27, 2025 • 15

commented a paper 4 months ago

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

Paper • 2509.23050 • Published Sep 27, 2025 • 15 •

upvoted a paper 9 months ago

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Paper • 2505.18943 • Published May 25, 2025 • 24

published a dataset 9 months ago

changdae/llavabench-shift-natural-v1

Viewer • Updated May 26, 2025 • 1.47k • 8

updated a dataset 9 months ago

changdae/llavabench-shift-natural-v1

Viewer • Updated May 26, 2025 • 1.47k • 8

published a dataset 9 months ago

changdae/llavabench-shift-synthetic-v1

Updated May 26, 2025 • 5

Changdae Oh

AI & ML interests

Recent Activity

Organizations

changdae's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment