Sijia Cui's picture

In a Training Loop 🔄

Sijia Cui

cuisijia

·

https://github.com/SijiaCui

AI & ML interests

None yet

Recent Activity

authored a paper about 11 hours ago

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

upvoted a collection 7 days ago

liked a dataset 13 days ago

rafaelpadilla/coco2017

View all activity

Organizations

authored a paper about 11 hours ago

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

Paper • 2603.10101 • Published 13 days ago • 5