In a Training Loop 🔄
lewtun
·
AI & ML interests
LLMs, LLMs, LLMs
Organizations
lewtun/s1K-1.1-dataforge-testing-20251219-213939
Viewer
•
Updated
•
1k
•
13
lewtun/s1K-1.1-dataforge-testing-20251219-081400
Viewer
•
Updated
•
819
•
35
lewtun/s1K-1.1-dataforge-testing-20251218-204703
Viewer
•
Updated
•
920
•
95
lewtun/dataforge-testing-20251218-152114
Viewer
•
Updated
•
1k
•
39
lewtun/s1K-1.1-dataforge-testing-20251216-142704
Viewer
•
Updated
•
10
•
15
lewtun/s1K-1.1-dataforge-testing-20251216-123019
Viewer
•
Updated
•
1k
•
42
lewtun/Polaris-Dataset-53K
Viewer
•
Updated
•
53.3k
•
41
lewtun/details_meta-llama__Llama-2-7b-chat-hf_private
Viewer
•
Updated
•
7.21k
•
166
lewtun/OpenThoughts3-missing-think-sample
Viewer
•
Updated
•
100
•
3
lewtun/details_Qwen__Qwen2.5-Coder-3B-Instruct
Viewer
•
Updated
•
33
•
34
lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Qwen-1.5B
Viewer
•
Updated
•
1k
•
10
lewtun/details_open-thoughts__OpenThinker-7B
Viewer
•
Updated
•
597
•
28
lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Qwen-7B
Viewer
•
Updated
•
597
•
33
lewtun/details_meta-llama__Llama-3.2-3B-Instruct
Viewer
•
Updated
•
1.74k
•
29
lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Llama-8B
Viewer
•
Updated
•
598
•
32
lewtun/details_meta-llama__Llama-3.1-8B-Instruct
Viewer
•
Updated
•
597
•
4
lewtun/details_Qwen__Qwen2.5-1.5B-Instruct
Viewer
•
Updated
•
2.25k
•
67
lewtun/details_Qwen__Qwen2.5-0.5B-Instruct
Viewer
•
Updated
•
898
•
9
lewtun/details_meta-llama__Llama-3.2-1B-Instruct
Viewer
•
Updated
•
898
•
8
lewtun/details_Qwen__Qwen2.5-Math-1.5B-Instruct
Viewer
•
Updated
•
11k
•
2
Viewer
•
Updated
•
1
•
3
lewtun/Llama-3.2-1B-Instruct-best_of_n-prm-completions
Viewer
•
Updated
•
10
•
2
Preview
•
Updated
•
96
lewtun/test-fast-parser-l1b-v3
Viewer
•
Updated
•
509
lewtun/test-fast-parser-l1b-v2
Viewer
•
Updated
•
509
•
1
lewtun/test-fast-parser-l1b
Viewer
•
Updated
•
509
•
3
Viewer
•
Updated
•
25
•
3
lewtun/bon-prm-serverless-batched
Viewer
•
Updated
•
240
•
3
Viewer
•
Updated
•
62.1k
•
2
lewtun/ultrafeedback_binarized
Viewer
•
Updated
•
62.1k
•
1