arxiv:2510.00237
Hejian Sang
pb09204048
ยท
AI & ML interests
None yet
Recent Activity
authored
a paper
about 7 hours ago
Debunk the Myth of SFT Generalization
upvoted
a
paper
4 months ago
Debunk the Myth of SFT Generalization
upvoted
a
paper
4 months ago
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller
LLMs
Organizations
None yet