arxiv:2405.14758
Junlin Wu
jlwu002
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 1 month ago
jlwu002/sr1_dataset
published
a dataset
about 1 month ago
jlwu002/sr1_dataset
authored
a paper
9 months ago
On the Exploitability of Reinforcement Learning with Human Feedback for
Large Language Models