·
AI & ML interests
None yet
Organizations
SongTonyLi/codellama-grpo-clip-0.1-peft-lora-r32-c2x86
Updated
SongTonyLi/codellama-sft-grpo-sft-c2x86
Updated
SongTonyLi/codellama-grpo-peft-lora-r32-c2x86
Updated
SongTonyLi/codellama-4bit-peft-lora-r32-sft-c2x86
Updated
SongTonyLi/gsm8krl-1b-lora1
Updated
SongTonyLi/gsm8ksft-1b-lora
Updated
SongTonyLi/Llama-3.2-3B-Instruct-CPT-D1_chosen-then-SFT-D2_chosen-pref-mix
Text Generation
•
3B
•
Updated
•
1
SongTonyLi/Llama-3.2-3B-Instruct-CPT-D1_chosen-pref-mix
Text Generation
•
3B
•
Updated
•
1
SongTonyLi/Llama-3.2-3B-Instruct-CPT-D_chosen-pref-mix
Text Generation
•
3B
•
Updated
•
1
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D1_chosen-then-D2_chosen-pref-mix
Text Generation
•
3B
•
Updated
•
1
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D1_chosen-pref-mix
Text Generation
•
3B
•
Updated
•
1
SongTonyLi/Llama-3.2-1B-Instruct-CPT-D1_chosen-then-SFT-D2_chosen-pref-mix2
Text Generation
•
1B
•
Updated
•
1
•
SongTonyLi/Llama-3.2-1B-Instruct-CPT-D1_chosen-pref-mix2
Text Generation
•
1B
•
Updated
•
1
•
SongTonyLi/Llama-3.2-1B-Instruct-CPT-D_chosen-pref-mix2
Text Generation
•
1B
•
Updated
•
1
•
SongTonyLi/Llama-3.2-1B-Instruct-SFT-D1_chosen-then-D2_chosen-pref-mix2
Text Generation
•
1B
•
Updated
•
3
•
SongTonyLi/Llama-3.2-1B-Instruct-SFT-D1_chosen-pref-mix2
Text Generation
•
1B
•
Updated
•
1
•
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D_chosen-pref-mix
Text Generation
•
3B
•
Updated
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D_chosen-pref-mix2
Text Generation
•
3B
•
Updated
SongTonyLi/Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix3
Text Generation
•
1B
•
Updated
•
3
•
SongTonyLi/Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix2
Text Generation
•
1B
•
Updated
•
2
•
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D_chosen-pref-mix9
Text Generation
•
3B
•
Updated
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D_chosen-pref-mix8
Text Generation
•
3B
•
Updated
•
2
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D_chosen-pref-mix7
Text Generation
•
3B
•
Updated
•
2
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D_chosen-pref-mix6
Text Generation
•
3B
•
Updated
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D_chosen-pref-mix5
Text Generation
•
3B
•
Updated
•
2
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D_chosen-pref-mix4
Text Generation
•
3B
•
Updated
•
2
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D_chosen-pref-mix3
Text Generation
•
3B
•
Updated
SongTonyLi/Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix9
Text Generation
•
1B
•
Updated
•
2
•
SongTonyLi/Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix8
Text Generation
•
1B
•
Updated
•
2
•
SongTonyLi/Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix7
Text Generation
•
1B
•
Updated
•
3
•