2 15

Zhenwen Liang

invokerliang

http://zhenwen-nlp.github.io/

LZhenwen

AI & ML interests

Mathematical Reasoning.

Recent Activity

updated a dataset 17 days ago

invokerliang/math_aime_grpo

published a dataset 17 days ago

invokerliang/math_aime_grpo

upvoted a paper about 1 month ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

View all activity

Organizations

updated a dataset 17 days ago

invokerliang/math_aime_grpo

Viewer • Updated 17 days ago • 9.04k • 109

published a dataset 17 days ago

invokerliang/math_aime_grpo

Viewer • Updated 17 days ago • 9.04k • 109

upvoted a paper about 1 month ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28 • 15

upvoted 3 papers about 2 months ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15 • 30

The Role of Computing Resources in Publishing Foundation Model Research

Paper • 2510.13621 • Published Oct 15 • 16

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Paper • 2510.09781 • Published Oct 10 • 26

upvoted a collection about 2 months ago

EVOL-RL

Collection

The models trained with EVOL-RL • 7 items • Updated Oct 3 • 1

authored a paper 2 months ago

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

Paper • 2510.01591 • Published Oct 2 • 26

upvoted a paper 2 months ago

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

Paper • 2510.01591 • Published Oct 2 • 26

upvoted a paper 3 months ago

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18 • 33

authored a paper 3 months ago

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18 • 33

upvoted a paper 3 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27 • 84

authored a paper 3 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27 • 84

upvoted a paper 5 months ago

Towards Solving More Challenging IMO Problems via Decoupled Reasoning and Proving

Paper • 2507.06804 • Published Jul 7 • 16

commented a paper 5 months ago

Towards Solving More Challenging IMO Problems via Decoupled Reasoning and Proving

Paper • 2507.06804 • Published Jul 7 • 16 •

authored a paper 6 months ago

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Paper • 2505.23754 • Published May 29 • 15

upvoted 2 papers 6 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 142

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Paper • 2505.23754 • Published May 29 • 15

authored a paper 7 months ago

MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation

Paper • 2505.10962 • Published May 16 • 8

upvoted a paper 7 months ago

MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation

Paper • 2505.10962 • Published May 16 • 8

Zhenwen Liang

AI & ML interests

Recent Activity

Organizations

invokerliang's activity