BaseReward: A Strong Baseline for Multimodal Reward Model Paper โข 2509.16127 โข Published Sep 19, 2025 โข 21
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs Paper โข 2602.12705 โข Published 12 days ago โข 61
Running on CPU Upgrade Featured 3.01k The Smol Training Playbook ๐ 3.01k The secrets to building world-class LLMs