-
Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning
Paper • 2506.03136 • Published • 25 -
Vibe Checker: Aligning Code Evaluation with Human Preference
Paper • 2510.07315 • Published • 32 -
VeriEquivBench: An Equivalence Score for Ground-Truth-Free Evaluation of Formally Verifiable Code
Paper • 2510.06296 • Published -
Strengthening Programming Comprehension in Large Language Models through Code Generation
Paper • 2508.12620 • Published
Ernan Hughes
ernanhughes
AI & ML interests
Making movies with AI
Recent Activity
updated
a collection
8 days ago
code
updated
a collection
8 days ago
code
updated
a collection
8 days ago
code
Organizations
None yet