Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance.
TitleOS PRO
TitleOS
AI & ML interests
I break the Xbox One/Series. Featured on OSGWiki. Former Xbox MVP. Previously InfoSec at Apple, then SRE at DreamBox Learning, now looking for a new opportunity. Artificial Intelligence LLM enthusiast, wannabe expert. They/Them. 🏳️🌈
Recent Activity
liked
a dataset
about 14 hours ago
NousResearch/hermes-function-calling-v1
liked
a model
1 day ago
google/gemma-3-12b-it
liked
a model
2 days ago
zai-org/GLM-5