-
LlameUser/LeetCodeDataset-IEC61131-3-ST
Viewer • Updated • 4.61k • 2 -
LlameUser/qwen-3-4b-thinking-r1-st
Text Generation • 196k • Updated • 2 • 1 -
LlameUser/qwen-3-4b-thinking-r1-st-easy
Text Generation • 196k • Updated • 1 -
LlameUser/qwen-3-4b-thinking-r1-st-medium
Text Generation • 196k • Updated • 6
Antoine Angert
LlameUser
AI & ML interests
Large Language Models
Instruction Tuning
GRPO
Efficient Fine-Tuning (LoRA, PEFT)
Multimodal Models
Interpretability & Evaluation
AI for Scientific Research
Organizations
None yet
IEC61131-3-ST Training
-
LlameUser/LeetCodeDataset-IEC61131-3-ST
Viewer • Updated • 4.61k • 2 -
LlameUser/qwen-3-4b-thinking-r1-st
Text Generation • 196k • Updated • 2 • 1 -
LlameUser/qwen-3-4b-thinking-r1-st-easy
Text Generation • 196k • Updated • 1 -
LlameUser/qwen-3-4b-thinking-r1-st-medium
Text Generation • 196k • Updated • 6
GRPO-Countdown-Problem
models
14
LlameUser/qwen-3-4b-instruct-r1-st
Text Generation
•
196k
•
Updated
•
1
LlameUser/qwen-3-4b-thinking-r1-st-hard
Text Generation
•
196k
•
Updated
LlameUser/qwen-3-4b-thinking-r1-st-medium
Text Generation
•
196k
•
Updated
•
6
LlameUser/qwen-3-4b-thinking-r1-st-easy
Text Generation
•
196k
•
Updated
•
1
LlameUser/qwen-3-4b-thinking-r1-st
Text Generation
•
196k
•
Updated
•
2
•
1
LlameUser/qwen-3-4b-thinking-r1-countdown
Text Generation
•
196k
•
Updated
LlameUser/qwen-3-1.7b-r1-countdown
Text Generation
•
2B
•
Updated
LlameUser/Qwen2.5-3B-Open-R1-GRPO
Text Generation
•
3B
•
Updated
LlameUser/Qwen2.5-1.5B-Open-R1-GRPO
Updated
LlameUser/qwen-3-4b-instruct-r1-countdown
Text Generation
•
196k
•
Updated
•
1