220 442

dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

liked a model 39 minutes ago

Qwen/Qwen3.5-35B-A3B

liked a model about 1 hour ago

LocoreMind/LocoOperator-4B

liked a dataset 1 day ago

SWE-Gym/SWE-Gym

View all activity

Organizations

None yet

liked a model 39 minutes ago

Qwen/Qwen3.5-35B-A3B

Image-Text-to-Text • 36B • Updated about 21 hours ago • 21k • 342

liked a model about 1 hour ago

LocoreMind/LocoOperator-4B

Text Generation • 4B • Updated about 23 hours ago • 232 • 173

liked a dataset 1 day ago

SWE-Gym/SWE-Gym

Viewer • Updated May 10, 2025 • 2.44k • 20.2k • 23

upvoted a paper 2 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published 14 days ago • 181

upvoted an article 3 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

302

liked a dataset 3 days ago

neulab/agent-data-collection

Preview • Updated 5 days ago • 2.25k • 107

upvoted a paper 4 days ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published 12 days ago • 30

liked a model 4 days ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated 3 days ago • 228k • • 791

upvoted a paper 5 days ago

Small Language Models are the Future of Agentic AI

Paper • 2506.02153 • Published Jun 2, 2025 • 24

upvoted a paper 8 days ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published 11 days ago • 67

upvoted 2 papers 9 days ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published 13 days ago • 56

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published 15 days ago • 228

liked a model 9 days ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 2 days ago • 483k • • 1.05k

upvoted an article 12 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

12 days ago

•

126

liked a model 12 days ago

MiniMaxAI/MiniMax-M2.5

Text Generation • Updated 9 days ago • 240k • • 926

liked a model 14 days ago

zai-org/GLM-5

Text Generation • 754B • Updated 12 days ago • 182k • • 1.54k

upvoted 3 papers 15 days ago

upvoted a paper 16 days ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published 19 days ago • 71