RLHFlow
/

pair-preference-model-LLaMA3-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Haoxiang-Wang commited on May 24

Commit

e27e178

•

1 Parent(s): c128eb7

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-{}
 ---
 This preference model is trained from [LLaMA3-8B-it](meta-llama/Meta-Llama-3-8B-Instruct) with the training script at [Reward Modeling](https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/pm_dev/pair-pm).

 ---
+license: llama3
 ---
 This preference model is trained from [LLaMA3-8B-it](meta-llama/Meta-Llama-3-8B-Instruct) with the training script at [Reward Modeling](https://github.com/RLHFlow/RLHF-Reward-Modeling/tree/pm_dev/pair-pm).