DidulaThavishaPro/fine_tuned_ballerina_coderank Sentence Similarity • 0.1B • Updated Dec 11, 2025 • 1
DidulaThavishaPro/exp_18_3_0grpo_checkpoint_220_16bit_vllm Text Generation • 8B • Updated Nov 12, 2025
DidulaThavishaPro/exp_16_1_0grpo_checkpoint_660_16bit_vllm Text Generation • 8B • Updated Nov 3, 2025 • 1
DidulaThavishaPro/exp_16_1_0grpo_checkpoint_560_16bit_vllm Text Generation • 8B • Updated Nov 3, 2025
DidulaThavishaPro/exp_16_1_0grpo_checkpoint_760_16bit_vllm Text Generation • 8B • Updated Nov 2, 2025 • 2
DidulaThavishaPro/exp_16_0_grpo_checkpoint_220_16bit_vllm Text Generation • 8B • Updated Nov 1, 2025 • 1
DidulaThavishaPro/exp_13_2_grpo_smooth_error_16bit_vllm Text Generation • 8B • Updated Oct 29, 2025 • 1