Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
adriata
/
dpo_calc_mistral
like
0
Text Generation
Transformers
Safetensors
mistral
unsloth
trl
dpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
dpo_calc_mistral
Commit History
Trained with Unsloth
9b76c16
verified
adriata
commited on
Mar 13
Upload tokenizer
75f14f0
verified
adriata
commited on
Mar 13
Trained with Unsloth
b5cc557
verified
adriata
commited on
Mar 13
Upload tokenizer
2d3225b
verified
adriata
commited on
Mar 13
initial commit
0fd0b60
verified
adriata
commited on
Mar 13