Jiarui Yao's picture

1 19 1

Jiarui Yao

FlippyDora

·

AI & ML interests

None yet

Recent Activity

authored a paper 16 days ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

upvoted a paper 17 days ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

submitted a paper 17 days ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

View all activity

Organizations

FlippyDora 's models 62

FlippyDora/gemma-2b-it_lora_r16_lr5e-4_dpo

Updated Oct 22, 2024 • 1

FlippyDora/gemma-2b-it_lr1e-5_ultrafeedback

3B • Updated Oct 16, 2024 • 1