Yunzhi Yao
cowTodd
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
From Data to Behavior: Predicting Unintended Model Behaviors Before Training
upvoted
a
paper
4 days ago
From Data to Behavior: Predicting Unintended Model Behaviors Before Training
authored
a paper
8 days ago
Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics