Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
189
46
KABI
dongguanting
Follow
akhaliq's profile picture
rafaaelneves's profile picture
vanillaOVO's profile picture
58 followers
·
94 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
6 days ago
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
upvoted
a
paper
7 days ago
Latent Collaboration in Multi-Agent Systems
upvoted
a
paper
13 days ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
View all activity
Organizations
dongguanting
's models
14
Sort: Recently updated
dongguanting/aepo_light
8B
•
Updated
Nov 3
•
3
dongguanting/Qwen2.5-7B-AEPO
Text Generation
•
8B
•
Updated
Oct 27
•
22
•
1
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
Oct 27
•
10
•
1
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
•
15B
•
Updated
Oct 21
•
8
•
1
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
Aug 19
•
925
•
2
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
Aug 12
•
16
•
1
dongguanting/Qwen2.5-3B-ARPO
Text Generation
•
3B
•
Updated
Aug 12
•
13
•
3
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
Aug 12
•
18
•
5
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29
•
9
•
2
dongguanting/Tool-Star-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 30
•
44
•
2
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28
•
83
•
3
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
•
0.6B
•
Updated
Jun 6
•
4
•
1
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
•
2B
•
Updated
Jun 6
•
7
•
2
dongguanting/Tool-Star-Qwen-3B
Text Generation
•
3B
•
Updated
May 25
•
8
•
5