3 19 16

Shaohang Wei

SylvainWei

https://sylvain-wei.github.io

AI & ML interests

NLP, LLM

Recent Activity

upvoted a paper 11 days ago

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

upvoted a paper 14 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

upvoted a paper 14 days ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

View all activity

Organizations

upvoted a paper 11 days ago

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

Paper • 2510.07896 • Published Oct 9, 2025 • 8

upvoted 2 papers 14 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published 29 days ago • 260

upvoted a paper 23 days ago

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Paper • 2602.07422 • Published about 1 month ago • 22

authored a paper 25 days ago

Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning

Paper • 2602.01745 • Published Feb 2 • 7

upvoted a paper 26 days ago

Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning

Paper • 2602.01745 • Published Feb 2 • 7

upvoted a paper 27 days ago

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published 29 days ago • 42

upvoted 2 papers 28 days ago

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published Feb 2 • 32

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 214

upvoted a paper about 1 month ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 201

liked a model about 1 month ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated 10 days ago • 2.47M • • 2.25k

upvoted a paper about 2 months ago

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Paper • 2601.12346 • Published Jan 18 • 49

liked a model about 2 months ago

mwhanna/qwen3-1.7b-transcoders-lowl0

Updated Aug 18, 2025 • 837 • 1

upvoted a paper about 2 months ago

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

Paper • 2601.07526 • Published Jan 12 • 24

liked a dataset 4 months ago

DigitalLearningGmbH/MATH-lighteval

Viewer • Updated Jan 15, 2025 • 25k • 21.4k • 64

upvoted a paper 4 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 97

upvoted a paper 5 months ago

Scaling Language-Centric Omnimodal Representation Learning

Paper • 2510.11693 • Published Oct 13, 2025 • 104

authored a paper 5 months ago

Mitigating Overthinking through Reasoning Shaping

Paper • 2510.09535 • Published Oct 10, 2025 • 5

commented a paper 5 months ago

Mitigating Overthinking through Reasoning Shaping

Paper • 2510.09535 • Published Oct 10, 2025 • 5 •

upvoted a paper 5 months ago

Mitigating Overthinking through Reasoning Shaping

Paper • 2510.09535 • Published Oct 10, 2025 • 5

Shaohang Wei

AI & ML interests

Recent Activity

Organizations

SylvainWei's activity