agurung/flawed-fictions-gemma-3-4b-lengthpenalty Reinforcement Learning • 4B • Updated 7 days ago • 63
agurung/flawed-fictions-qwen3-4b-lengthpenalty-litereason Reinforcement Learning • 4B • Updated 7 days ago • 28
agurung/flawed-fictions-qwen25-7b-lengthpenalty-litereason Reinforcement Learning • 8B • Updated 10 days ago • 75
agurung/flawed-fictions-qwen25-7b-lengthpenalty Reinforcement Learning • 8B • Updated 12 days ago • 196