Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
61
14
taicheng guo
taicheng
Follow
lx865712528's profile picture
gjy13510451506's profile picture
Gargaz's profile picture
12 followers
·
59 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Group-in-Group Policy Optimization for LLM Agent Training
upvoted
a
paper
3 months ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
liked
a model
4 months ago
meta-llama/Llama-3.2-3B
View all activity
Organizations
Papers
5
arxiv:
2510.12831
arxiv:
2402.18679
arxiv:
2402.05138
arxiv:
2402.01680
Expand 5 papers
models
46
Sort: Recently updated
taicheng/zephyr-7b-align-scan-0.0-0.0-linear-1
Text Generation
•
7B
•
Updated
Sep 28, 2024
•
3
taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-1
Text Generation
•
7B
•
Updated
Sep 28, 2024
taicheng/zephyr-7b-align-scan-0.0-0.0-cosine-2
Text Generation
•
7B
•
Updated
Sep 28, 2024
•
4
taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-2
Text Generation
•
7B
•
Updated
Sep 28, 2024
•
1
taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-3
Text Generation
•
7B
•
Updated
Sep 28, 2024
•
1
taicheng/zephyr-7b-align-scan-0.0-0.0-linear-3
Text Generation
•
7B
•
Updated
Sep 28, 2024
•
4
taicheng/zephyr-7b-align-scan
Text Generation
•
7B
•
Updated
Sep 28, 2024
•
4
taicheng/zephyr-7b-align-scan-1e-07-0.27-polynomial-1.0
Updated
Sep 28, 2024
taicheng/zephyr-7b-align-scan-7e-07-0.45-cosine-3.0
Text Generation
•
7B
•
Updated
Sep 28, 2024
•
1
taicheng/zephyr-7b-align-scan-6e-07-0.53-polynomial-2.0
Text Generation
•
7B
•
Updated
Sep 28, 2024
View 46 models
datasets
0
None public yet