sanduntg
/

output

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

137 MB

1 contributor

History: 3 commits

sanduntg's picture

sanduntg/llama_2_dpo_with_reward_1000

27e25fd verified almost 2 years ago

runs
sanduntg/llama_2_dpo_with_reward_1000 almost 2 years ago
.gitattributes
1.52 kB

initial commit almost 2 years ago
README.md
1.12 kB

sanduntg/llama_2_dpo_with_reward_1000 almost 2 years ago
adapter_config.json
649 Bytes

sanduntg/llama_2_dpo_with_reward_1000 almost 2 years ago
adapter_model.safetensors
134 MB
xet

sanduntg/llama_2_dpo_with_reward_1000 almost 2 years ago
generation_config.json
183 Bytes

sanduntg/llama_2_dpo_with_reward_2 almost 2 years ago
special_tokens_map.json
438 Bytes

sanduntg/llama_2_dpo_with_reward_2 almost 2 years ago
tokenizer.json
1.84 MB

sanduntg/llama_2_dpo_with_reward_2 almost 2 years ago
tokenizer.model
500 kB
xet

sanduntg/llama_2_dpo_with_reward_2 almost 2 years ago
tokenizer_config.json
945 Bytes

sanduntg/llama_2_dpo_with_reward_2 almost 2 years ago
training_args.bin
4.48 kB
xet

sanduntg/llama_2_dpo_with_reward_1000 almost 2 years ago