·
AI & ML interests
None yet
Organizations
gx-ai-architect/ultrafeedback-dice-iter1-sft-drsow-first-half-vanilla-router
Viewer
•
Updated
•
60.9k
•
2
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-alpha-normalize-0.04-bo32-correct-long
Viewer
•
Updated
•
52k
•
1
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-length-normalize-bo32-correct
Viewer
•
Updated
•
52k
•
3
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-length-normalize-bo32
Viewer
•
Updated
•
60.9k
•
4
gx-ai-architect/ultrafeedback-qwen-32b-instruct-vanilla-router-alpha-normalize-0.04-bo32
Viewer
•
Updated
•
60.9k
•
3
gx-ai-architect/ultrafeedback-eurus-7b-classifier-annotation-bo32
Viewer
•
Updated
•
60.8k
•
3
gx-ai-architect/ultrafeedback-qwen32b-instruct-vs-base-vanilla-router-filter-minus50-bo32
Viewer
•
Updated
•
57.9k
gx-ai-architect/ultrafeedback-new-trl
Viewer
•
Updated
•
63.1k
•
3
gx-ai-architect/ultrafeedback-llama-rdpo-vs-sft-dpo-vanilla-router-filter-minus50-bo32
Viewer
•
Updated
•
58.4k
•
1
gx-ai-architect/ultrafeedback-mistral-rdpo-vs-base-dpo-vanilla-router-filter-minus50-bo32
Viewer
•
Updated
•
58.4k
•
2
gx-ai-architect/ultrafeedback-rdpo-vs-zepher-dpo-vanilla-router-filter-minus50-bo32-updated1
Viewer
•
Updated
•
51.1k
gx-ai-architect/ultrafeedback-rdpo-vs-zepher-dpo-vanilla-router-filter-minus50-bo32-updated
Viewer
•
Updated
•
51.1k
•
3
gx-ai-architect/ultrafeedback-rdpo-vs-zepher-dpo-vanilla-router-filter-minus50-bo32
Viewer
•
Updated
•
51.1k
•
3
gx-ai-architect/financebench-numerical-fixed
Viewer
•
Updated
•
150
•
1
gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl-40k-old-r1-format
Viewer
•
Updated
•
39k
•
4
gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl-40k
Viewer
•
Updated
•
39k
•
7
gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl
Viewer
•
Updated
•
39k
•
4
gx-ai-architect/official_half_rh_half_r1_prompt_60k
Viewer
•
Updated
•
62k
•
6
gx-ai-architect/official_dpo_rh_bo8_random_rej_balanced
Viewer
•
Updated
•
48.8k
gx-ai-architect/official_dpo_r1_prompt_bo8_random_rej_balanced_fixed
Viewer
•
Updated
•
59.4k
•
2
gx-ai-architect/official_dpo_r1_prompt_bo8_random_rej_balanced
Viewer
•
Updated
•
59.4k
•
8
gx-ai-architect/official_dpo_r1_prompt_bo8_random_rej
Viewer
•
Updated
•
50.9k
•
1
gx-ai-architect/trl_dpo_vanilla_bo8_random_rej
Viewer
•
Updated
•
59.1k
gx-ai-architect/dpo_vanilla_bo8_random_rej
Viewer
•
Updated
•
59.1k
gx-ai-architect/helpsteer_hh_combined_pref_filtered
Viewer
•
Updated
•
211k
•
1
gx-ai-architect/helpsteer_hh_combined_pref
Viewer
•
Updated
•
211k
•
3
•
1
gx-ai-architect/helpsteer_combined_pref
Viewer
•
Updated
•
50.7k
•
3
gx-ai-architect/helpsteer_combined
Viewer
•
Updated
•
58.5k
•
2
gx-ai-architect/HelpSteer2_DPO
Viewer
•
Updated
•
7.59k
•
2
•
1
gx-ai-architect/helpsteer_preference
Viewer
•
Updated
•
35.6k
•
5