AIMO AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 137 • 350 Running 418 Reward Bench Leaderboard 📐 418 Display and analyze reward model evaluation results KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
ARC 01-ai/Yi-Coder-9B-Chat Text Generation • 9B • Updated Sep 12, 2024 • 7.87k • 212 AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 137 • 350
AIMO AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 137 • 350 Running 418 Reward Bench Leaderboard 📐 418 Display and analyze reward model evaluation results KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
ARC 01-ai/Yi-Coder-9B-Chat Text Generation • 9B • Updated Sep 12, 2024 • 7.87k • 212 AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 137 • 350