weqweasdas
/

RM-Mistral-7B

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Fix dataset link

#1

by ZennyKenny - opened Mar 31

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ If you have any question with this reward model and also any question about rewa
 <!-- Provide a longer summary of what this model is. -->
-The model is trained on a mixture of the following datasets. We also provide the mixture in [weqweasdas/preference_dataset_mixture2_and_safe_pku](weqweasdas/preference_dataset_mixture2_and_safe_pku).
 - [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
 - [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
 - [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)

 <!-- Provide a longer summary of what this model is. -->
+The model is trained on a mixture of the following datasets. We also provide the mixture in [weqweasdas/preference_dataset_mixture2_and_safe_pku](https://huggingface.co/datasets/weqweasdas/preference_dataset_mixture2_and_safe_pku).
 - [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
 - [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
 - [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)