Wanwei He
Grocery
AI & ML interests
LLM
Recent Activity
liked
a model about 16 hours ago
Qwen/Qwen3.5-35B-A3B commented on
a paper
6 months ago
Implicit Actor Critic Coupling via a Supervised Learning Framework for
RLVR upvoted a paper 6 months ago
Implicit Actor Critic Coupling via a Supervised Learning Framework for
RLVR