Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Project of MoE reward model

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

shengyi-qian  authored a paper 28 days ago
DigiData: Training and Evaluating General-Purpose Mobile Control Agents
zhuokai  authored a paper 29 days ago
Scaling Agent Learning via Experience Synthesis
zhuokai  authored a paper about 2 months ago
From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding
View all activity

Zhuokai Zhao's profile picture Shengyi Qian's profile picture Yuhang Zhou's profile picture Xiaoyu Liu's profile picture Jing Zhu's profile picture wave's profile picture

MoeReward 's models 6

MoeReward/rl_checkpoints

Updated Jun 27

MoeReward/lora_checkpoint

Updated Mar 30

MoeReward/reward_lora_qwen_1_5_base

Updated Mar 21 • 3

MoeReward/reward_qwen_1_5

14B • Updated Mar 17 • 5

MoeReward/reward_lora_qwen_1_5

Updated Mar 17 • 2

MoeReward/sft_full_param_qwen_1_5

14B • Updated Mar 16 • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs