Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
19
1
Jiarui Yao
FlippyDora
Follow
research4pan's profile picture
1 follower
·
20 following
AI & ML interests
None yet
Recent Activity
authored
a paper
16 days ago
PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary
upvoted
a
paper
17 days ago
PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary
submitted
a paper
17 days ago
PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary
View all activity
Organizations
FlippyDora
's models
62
Sort: Recently updated
FlippyDora/gemma-2b-it_lora_r16_lr5e-4_dpo
Updated
Oct 22, 2024
•
1
FlippyDora/gemma-2b-it_lr1e-5_ultrafeedback
3B
•
Updated
Oct 16, 2024
•
1
Previous
1
2
3
Next