PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE
-
jinachris/Qwen2.5-7B-PURE-PRM
Text Generation • 8B • Updated • 3 • 1 -
jinachris/Qwen2.5-7B-PURE-VR
Text Generation • 8B • Updated • 1 • 1 -
jinachris/Qwen2.5-7B-PURE-PRMVR
Text Generation • 8B • Updated • 2 • 1 -
jinachris/PURE-PRM-7B
Token Classification • 7B • Updated • 7 • 4