Update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,9 @@ pipeline_tag: text-generation
|
|
| 11 |
|
| 12 |
Mistral-7B-v0.1 model fine-tuned on the Ultrafeedback dataset using techinques shown in the paper [Self-Rewarding Language Models](https://arxiv.org/abs/2401.10020).
|
| 13 |
|
|
|
|
|
|
|
|
|
|
| 14 |
## Instruction format
|
| 15 |
|
| 16 |
In order to leverage instruction fine-tuning, your prompt should be surrounded by `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.
|
|
|
|
| 11 |
|
| 12 |
Mistral-7B-v0.1 model fine-tuned on the Ultrafeedback dataset using techinques shown in the paper [Self-Rewarding Language Models](https://arxiv.org/abs/2401.10020).
|
| 13 |
|
| 14 |
+
|
| 15 |
+

|
| 16 |
+
|
| 17 |
## Instruction format
|
| 18 |
|
| 19 |
In order to leverage instruction fine-tuning, your prompt should be surrounded by `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.
|