Fix dataset link
#1
by
ZennyKenny
- opened
README.md
CHANGED
@@ -18,7 +18,7 @@ If you have any question with this reward model and also any question about rewa
|
|
18 |
|
19 |
<!-- Provide a longer summary of what this model is. -->
|
20 |
|
21 |
-
The model is trained on a mixture of the following datasets. We also provide the mixture in [weqweasdas/preference_dataset_mixture2_and_safe_pku](weqweasdas/preference_dataset_mixture2_and_safe_pku).
|
22 |
- [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
|
23 |
- [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
|
24 |
- [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)
|
|
|
18 |
|
19 |
<!-- Provide a longer summary of what this model is. -->
|
20 |
|
21 |
+
The model is trained on a mixture of the following datasets. We also provide the mixture in [weqweasdas/preference_dataset_mixture2_and_safe_pku](https://huggingface.co/datasets/weqweasdas/preference_dataset_mixture2_and_safe_pku).
|
22 |
- [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
|
23 |
- [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
|
24 |
- [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)
|