·
AI & ML interests
None yet
Organizations
models
156
SongTonyLi/codellama-grpo-clip-0.1-peft-lora-r32-c2x86
Updated
SongTonyLi/codellama-sft-grpo-sft-c2x86
Updated
SongTonyLi/codellama-grpo-peft-lora-r32-c2x86
Updated
SongTonyLi/codellama-4bit-peft-lora-r32-sft-c2x86
Updated
SongTonyLi/gsm8krl-1b-lora1
Updated
SongTonyLi/gsm8ksft-1b-lora
Updated
SongTonyLi/Llama-3.2-3B-Instruct-CPT-D1_chosen-then-SFT-D2_chosen-pref-mix
Text Generation
•
3B
•
Updated
•
1
SongTonyLi/Llama-3.2-3B-Instruct-CPT-D1_chosen-pref-mix
Text Generation
•
3B
•
Updated
•
1
SongTonyLi/Llama-3.2-3B-Instruct-CPT-D_chosen-pref-mix
Text Generation
•
3B
•
Updated
•
1
SongTonyLi/Llama-3.2-3B-Instruct-SFT-D1_chosen-then-D2_chosen-pref-mix
Text Generation
•
3B
•
Updated
•
1
datasets
25
SongTonyLi/c2x86-leetcode-eval
Viewer
•
Updated
•
20
•
7
Viewer
•
Updated
•
899
•
7
•
1
SongTonyLi/llama_1B_dolly_15k_data
Viewer
•
Updated
•
15k
•
5
SongTonyLi/llama_1B_stem_data
Viewer
•
Updated
•
151k
•
16
Viewer
•
Updated
•
178k
•
2
SongTonyLi/tachibana_coding_10k
Viewer
•
Updated
•
104k
•
5
SongTonyLi/Magpie-Pro-Ultra-40K
Viewer
•
Updated
•
41.4k
•
2
SongTonyLi/Magpie-Pro-hard-8K
Viewer
•
Updated
•
8k
•
8
SongTonyLi/llama-1b-preference-merge-mix
Viewer
•
Updated
•
142k
•
7
•
1
SongTonyLi/dpo-mix_skywork_infinity
Viewer
•
Updated
•
131k
•
4
•
1