15 8 2

Jie Chen

survivi

survivi

AI & ML interests

Large Language Model, Natural Language Processing

Recent Activity

authored a paper 7 days ago

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

authored a paper 7 days ago

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models

authored a paper 7 days ago

Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework

View all activity

Organizations

authored 4 papers 7 days ago

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

Paper • 2508.07534 • Published Aug 11, 2025 • 1

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models

Paper • 2406.12397 • Published Jun 18, 2024

Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework

Paper • 2509.05007 • Published Sep 5, 2025

Decomposing the Entropy-Performance Exchange: The Missing Keys to Unlocking Effective Reinforcement Learning

Paper • 2508.02260 • Published Aug 4, 2025

published 2 models 9 days ago

survivi/Qwen3-0.6B-TIT-DST

Updated 9 days ago

survivi/Qwen3-CIR

8B • Updated Sep 1, 2025 • 6

updated a model 6 months ago

survivi/Qwen3-CIR

8B • Updated Sep 1, 2025 • 6

updated 3 datasets 8 months ago

published a dataset 8 months ago

survivi/grad_cilp0.28_100

Viewer • Updated Jun 25, 2025 • 3.09M • 18.5k

updated a model 8 months ago

survivi/grpo_clip0.2_0.28_80-20-mask0.80

Updated Jun 25, 2025

published a dataset 8 months ago

survivi/baseline_dapo_positive_only

Viewer • Updated Jun 25, 2025 • 6.88k • 931

published a model 8 months ago

survivi/grpo_clip0.2_0.28_80-20-mask0.80

Updated Jun 25, 2025

updated a model 8 months ago

survivi/grpo_clip0.2_0.28_80-20-advantage0.80-0.1

Updated Jun 25, 2025

published a dataset 8 months ago

survivi/baseline_dapo_final2

Viewer • Updated Jun 26, 2025 • 4.57k • 85

published a model 8 months ago

survivi/grpo_clip0.2_0.28_80-20-advantage0.80-0.1

Updated Jun 25, 2025

updated a dataset 8 months ago

survivi/grad_clip0.28_merged

Viewer • Updated Jun 24, 2025 • 240k • 194

published a dataset 8 months ago

survivi/grad_clip0.28_merged

Viewer • Updated Jun 24, 2025 • 240k • 194

upvoted a collection 8 months ago

MiniCPM4

Collection

MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated 3 days ago • 83

Jie Chen

AI & ML interests

Recent Activity

Organizations

survivi's activity