Antoine Angert
LlameUser
AI & ML interests
Large Language Models
Instruction Tuning
GRPO
Efficient Fine-Tuning (LoRA, PEFT)
Multimodal Models
Interpretability & Evaluation
AI for Scientific Research
Organizations
None yet
models 14
LlameUser/qwen-3-4b-instruct-r1-st
Text Generation • 196k • Updated
LlameUser/qwen-3-4b-thinking-r1-st-hard
Text Generation • 196k • Updated
LlameUser/qwen-3-4b-thinking-r1-st-medium
Text Generation • 196k • Updated
LlameUser/qwen-3-4b-thinking-r1-st-easy
Text Generation • 196k • Updated
LlameUser/qwen-3-4b-thinking-r1-st
Text Generation • 196k • Updated
• 1 • 1
LlameUser/qwen-3-4b-thinking-r1-countdown
Text Generation • 196k • Updated
LlameUser/qwen-3-1.7b-r1-countdown
Text Generation • 2B • Updated
LlameUser/Qwen2.5-3B-Open-R1-GRPO
Text Generation • 3B • Updated
• 1
LlameUser/Qwen2.5-1.5B-Open-R1-GRPO
Updated
LlameUser/qwen-3-4b-instruct-r1-countdown
Text Generation • 196k • Updated
• 2