zhangchenxu/BV-Qwen2.5-Math-7B-deepmath_pw_lv-TinyV-1.5B-addon-2_step468 8B • Updated Jul 29, 2025 • 10
view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? 21 days ago • 12
view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? 21 days ago • 12
PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated about 24 hours ago • 7.98k • 1.57k
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents Paper • 2601.12294 • Published Jan 18 • 19