Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
BKM1804
/
Hermes-2-Pro-Mistral-7B-10e14612-7986-40bd-ac61-53f567641e65-dpo-tuned
like
0
Transformers
Safetensors
Generated from Trainer
trl
sft
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Hermes-2-Pro-Mistral-7B-10e14612-7986-40bd-ac61-53f567641e65-dpo-tuned
/
training_args.bin
Commit History
End of training
b16446c
verified
BKM1804
commited on
May 19, 2025