Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -18,7 +18,7 @@ If you have any question with this reward model and also any question about rewa
18
 
19
  <!-- Provide a longer summary of what this model is. -->
20
 
21
- The model is trained on a mixture of the following datasets. We also provide the mixture in [weqweasdas/preference_dataset_mixture2_and_safe_pku](weqweasdas/preference_dataset_mixture2_and_safe_pku).
22
  - [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
23
  - [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
24
  - [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)
 
18
 
19
  <!-- Provide a longer summary of what this model is. -->
20
 
21
+ The model is trained on a mixture of the following datasets. We also provide the mixture in [weqweasdas/preference_dataset_mixture2_and_safe_pku](https://huggingface.co/datasets/weqweasdas/preference_dataset_mixture2_and_safe_pku).
22
  - [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
23
  - [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
24
  - [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)