AI & ML interests
None yet
Organizations
Audreygyj/Qwen2-1.5B-SFT-dpo
Text Generation
• 2B • Updated
• 2
Audreygyj/qwen-1.5b-sft-HH-offline-dpo
Updated
Audreygyj/Qwen2-1.5B-SFT-GS-OAIF-merge
Text Generation
• 2B • Updated
• 1
Audreygyj/Qwen2-1.5B-Instruct-GS-OAIF-merge
Text Generation
• 2B • Updated
Audreygyj/Qwen2-1.5B-Instruct-OAIF-merge
Text Generation
• 2B • Updated
• 1
Audreygyj/Qwen2.5-0.5B-SFT-OAIF-merge
Text Generation
• 0.5B • Updated
• 1
Audreygyj/Qwen2-1.5B-SFT-OAIF-merge
Text Generation
• 2B • Updated
• 3
Audreygyj/Qwen2.5-0.5B-SFT-GS-OAIF-merge
Text Generation
• 0.5B • Updated
• 2
Audreygyj/Qwen2.5-0.5B-SFT-merge
Text Generation
• 0.5B • Updated
• 3
• Audreygyj/Qwen2.5-0.5B-SFT
Updated
Audreygyj/Qwen2.5-1.5B-SFT
Updated
Audreygyj/pythia-1b-online-dpo-ground-truth-lead-merge
Text Generation
• 1B • Updated
Audreygyj/pythia-160m-online-dpo-ground-truth-lead-merge
Text Generation
• 0.2B • Updated
Audreygyj/pythia-160m-online-dpo-HH-2
Updated
Audreygyj/pythia-160m-online-dpo-SG-merge
Text Generation
• 0.2B • Updated
• 1
Audreygyj/pythia-160m-online-dpo-HH
Updated
Audreygyj/pythia-160m-online-dpo-SG
Updated
Audreygyj/pythia-160m-sft-HH-2-merge
Text Generation
• 0.2B • Updated
Audreygyj/pythia-160m-online-dpo-SG_test-merge
Text Generation
• 0.2B • Updated
Audreygyj/pythia-160m-sft-SG-merge
Text Generation
• 0.2B • Updated
• 5
Audreygyj/pythia-160m-online-dpo-SG_test
Updated
Audreygyj/pythia-160m-sft-HH-2
Updated
Audreygyj/pythia-160m-sft-SG
Updated
Audreygyj/pythia-160m-sft-HH
Updated
Audreygyj/peft-model-3072
Updated
Audreygyj/peft-model-3000
Updated
Audreygyj/peft-model-2500
Audreygyj/peft-model-2000
Updated
Audreygyj/peft-model-1500