PRIME

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

JC-Chen authored a paper 19 days ago

Symbol: Generating Flexible Black-Box Optimizers through Symbolic Equation Learning

JC-Chen authored a paper 19 days ago

LLaMoCo: Instruction Tuning of Large Language Models for Optimization Code Generation

JC-Chen authored a paper 19 days ago

InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling

View all activity

JC-Chen

authored 5 papers 19 days ago

Symbol: Generating Flexible Black-Box Optimizers through Symbolic Equation Learning

Paper • 2402.02355 • Published Feb 4, 2024

LLaMoCo: Instruction Tuning of Large Language Models for Optimization Code Generation

Paper • 2403.01131 • Published Mar 2, 2024

InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling

Paper • 2508.08636 • Published Aug 12 • 2

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9 • 31

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published 20 days ago • 132

stingning

updated a Space about 1 month ago

README

🏃

JC-Chen

published a model about 1 month ago

PRIME-RL/P1-30B-A3B

Text Generation • 31B • Updated Oct 24 • 250 • 8

JC-Chen

updated 2 models about 1 month ago

PRIME-RL/P1-30B-A3B

Text Generation • 31B • Updated Oct 24 • 250 • 8

PRIME-RL/P1-235B-A22B

Text Generation • 235B • Updated Oct 24 • 28 • 15

JC-Chen

published a model about 2 months ago

PRIME-RL/P1-235B-A22B

Text Generation • 235B • Updated Oct 24 • 28 • 15

ganqu

authored a paper 2 months ago

V-GameGym: Visual Game Generation for Code Large Language Models

Paper • 2509.20136 • Published Sep 24 • 9

Yirany

authored a paper 2 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 51

ganqu

authored a paper 2 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 51

ganqu

authored a paper 3 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 114

stingning

authored 6 papers 3 months ago

Arbitrary Few Parameters are Good Enough for Adapting Large-scale Pre-trained Language Models

Paper • 2306.02320 • Published Jun 4, 2023 • 1

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset

Paper • 2402.04588 • Published Feb 7, 2024 • 2

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6 • 24

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

Paper • 2504.03612 • Published Apr 4 • 2

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment

Paper • 2402.19085 • Published Feb 29, 2024

AI & ML interests

Recent Activity

Team members 7

PRIME-RL's activity

README