aisingapore
/

sea-lion-3b

Text Generation

Model card Files Files and versions Community

dotw commited on Oct 24, 2023

Commit

06158fe

•

1 Parent(s): 88e07b3

Update README.md

Files changed (1) hide show

README.md +19 -11

README.md CHANGED Viewed

@@ -2,27 +2,35 @@
 license: apache-2.0
 ---
-# Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
-This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
 ## Model Details
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->
-- **Developed by:** [More Information Needed]
-- **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
 ### Model Sources [optional]

 license: apache-2.0
 ---
+# Model Card for SEA LION
 <!-- Provide a quick summary of what the model is/does. -->
+SEA LION is a collection of LLMs which has been pretrained and instruct-tuned for the Southeast Asia region.
+The models range from 3 billion to 7 billion parameters.
+This is the repository for the 3B pretrained model.
 ## Model Details
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->
+The SEA LION model is a significant leap forward in the field of natural language processing and understanding,
+specifically trained to understand South-East Asia (SEA) regional context.
+SEA LION stands for SouthEast Asian Languages In One Network.
+The SEA LION model comes in two variants, one with 3 billion parameters and another with 7 billion parameters.
+Both variants are built on the robust MPT architecture and utilize a vocabulary size of 256K.
+The model employs our proprietary SEABPETokenizer for tokenization.
+Our SEABPETokenizer is specially tailored for SEA languages, ensuring optimal model performance.
+The training data for SEA LION is encompasses 1 trillion tokens.
+- **Developed by:** Products Pillar, AI Singapore
+- **Funded by [optional]:** Singapore NRF
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
+- **Language(s) (NLP):** English, Chinese, Indonesian, Malay, Thai, Vietnamese, Filipino/Tagalog, Tamil, Burnese, Khmer, Lao
+- **License:** Apache 2.0
+- **Finetuned from model [optional]:** N/A
 ### Model Sources [optional]