Dominique Mariko's picture

15 33

Dominique Mariko PRO

tiptales

·

tiptales

AI & ML interests

None yet

Recent Activity

liked a Space 9 days ago

burtenshaw/karpathy-llm-council

updated a collection 2 months ago

upvoted a paper 2 months ago

Flow-GRPO: Training Flow Matching Models via Online RL

View all activity

Organizations

liked a Space 9 days ago

Karpathy Llm Council

Ask a question to a council of AI models for a detailed answer

updated a collection 2 months ago

agens

5 items • Updated Sep 30

upvoted 2 papers 2 months ago

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8 • 86

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28 • 46

updated a collection 2 months ago

agens

5 items • Updated Sep 30

upvoted 4 papers 2 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 124

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 225

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25 • 345

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 660

upvoted a paper 3 months ago

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15 • 47

updated a collection 3 months ago

data open access

Open and clean datasets • 7 items • Updated Sep 7

liked a dataset 4 months ago

promptfoo/political-questions

Preview • Updated Jul 25 • 18 • 3

liked a model 4 months ago

pytorch/SmolLM3-3B-INT8-INT4

Text Generation • Updated Sep 11 • 52 • 37

upvoted a collection 5 months ago

Releases July 4

25 items • Updated Jul 7 • 7

updated 2 collections 5 months ago

data open access

Open and clean datasets • 7 items • Updated Sep 7

slm

4 items • Updated Jul 4