Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jie Cheng's picture
17 14

Jie Cheng

jinachris
dark-pen's profile picture
·
https://github.com/CJReinforce
  • CJReinforce

AI & ML interests

Reinforcement learning, LLM

Recent Activity

liked a model 1 day ago
cerebras/Step-3.5-Flash-REAP-121B-A11B
liked a Space 8 days ago
stepfun-ai/Step-3.5-Flash
upvoted a paper 27 days ago
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
View all activity

Organizations

None yet

authored 2 papers 10 months ago

Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning

Paper • 2504.15275 • Published Apr 21, 2025 • 2

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Paper • 2410.00564 • Published Oct 1, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs