2 544

Lei Wang

demolei

https://demoleiwang.github.io/HomePage/

AI & ML interests

LLMs

Recent Activity

upvoted a paper 3 days ago

Free(): Learning to Forget in Malloc-Only Reasoning Models

upvoted a paper 4 days ago

AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis

upvoted a paper 10 days ago

Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models

View all activity

Organizations

upvoted a paper 3 days ago

Free(): Learning to Forget in Malloc-Only Reasoning Models

Paper • 2602.08030 • Published 7 days ago • 5

upvoted a paper 4 days ago

AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis

Paper • 2602.09372 • Published 6 days ago • 5

upvoted a paper 10 days ago

Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models

Paper • 2602.01849 • Published 13 days ago • 5

upvoted a paper 28 days ago

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published Jan 12 • 41

authored a paper about 1 month ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published Jan 14 • 126

upvoted 3 papers about 1 month ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published Jan 14 • 126

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

Paper • 2601.03111 • Published Jan 6 • 10

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 225

upvoted 4 papers about 2 months ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 38

upvoted 8 papers 2 months ago

Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions

Paper • 2503.22678 • Published Mar 28, 2025 • 2

Very Large-Scale Multi-Agent Simulation in AgentScope

Paper • 2407.17789 • Published Jul 25, 2024 • 35

DeepCode: Open Agentic Coding

Paper • 2512.07921 • Published Dec 8, 2025 • 33

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 78

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published Nov 23, 2025 • 167

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

Paper • 2501.03936 • Published Jan 7, 2025 • 23

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 49

PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing

Paper • 2512.02589 • Published Dec 2, 2025 • 71

Lei Wang

AI & ML interests

Recent Activity

Organizations

demolei's activity