matsuo-lab
commited on
Commit
•
d6fc432
1
Parent(s):
8ff1eaf
Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ This repository provides a Japanese-centric multilingual GPT-NeoX model of 10 bi
|
|
17 |
|
18 |
* **Pre-training**
|
19 |
|
20 |
-
The model was trained on around **600B** tokens from a mixture of the following corpora
|
21 |
|
22 |
- [Japanese C4](https://huggingface.co/datasets/mc4)
|
23 |
- [The Pile](https://huggingface.co/datasets/EleutherAI/pile)
|
|
|
17 |
|
18 |
* **Pre-training**
|
19 |
|
20 |
+
The model was trained on around **600B** tokens from a mixture of the following corpora.
|
21 |
|
22 |
- [Japanese C4](https://huggingface.co/datasets/mc4)
|
23 |
- [The Pile](https://huggingface.co/datasets/EleutherAI/pile)
|