YAML Metadata Warning: empty or missing yaml metadata in repo card

Check out the documentation for more information.

Language Model Learns to Mislead Humans via RLHF

This repository contains the RLHF'ed code generation model in our paper: https://arxiv.org/pdf/2409.12822.

It's initialized based on deepseek-coder-7B.

Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for jiaxin-wen/MisleadLM-code