aisingapore
/

sea-lion-7b-instruct

@@ -4,10 +4,10 @@ license: mit
 # SEA-LION-7B-Instruct
 SEA-LION is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
-The size of the models range from 3 billion to 7 billion parameters.
 SEA-LION-7B-Instruct is a multilingual model which has been fine-tuned with **thousands of English and Indonesian instruction-completion pairs** alongside a smaller pool of instruction-completion pairs from other ASEAN languages.
-These instructions have been carefully curated and rewritten to ensure the model is trained on truly open, commercially permissive and high quality datasets.
 SEA-LION stands for _Southeast Asian Languages In One Network_.
@@ -19,7 +19,7 @@ SEA-LION stands for _Southeast Asian Languages In One Network_.
 ## Model Details
 ### Base model
-We perform instruction tuning in English and Indonesian on our [pre-trained SEA-LION-7B](https://huggingface.co/aisingapore/sea-lion-7b), a decoder model using the MPT architecture, to create SEA-LION-7B-Instruct.
 ### Benchmark Performance
 We evaluated SEA-LION-7B-Instruct on the BHASA benchmark ([arXiv](https://arxiv.org/abs/2309.06085v2) and [GitHub](https://github.com/aisingapore/bhasa)) across a variety of tasks.
@@ -131,8 +131,7 @@ For more info, please contact us using this [SEA-LION Inquiry Form](https://form
 ## Disclaimer
-This the repository for the commercial instruction-tuned model.
 The model has _not_ been aligned for safety.
 Developers and users should perform their own safety fine-tuning and related security measures.
-In no event shall the authors be held liable for any claim, damages, or other liability
-arising from the use of the released weights and codes.

 # SEA-LION-7B-Instruct
 SEA-LION is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
+The sizes of the models range from 3 billion to 7 billion parameters.
 SEA-LION-7B-Instruct is a multilingual model which has been fine-tuned with **thousands of English and Indonesian instruction-completion pairs** alongside a smaller pool of instruction-completion pairs from other ASEAN languages.
+These instructions have been carefully curated and rewritten to ensure the model was trained on truly open, commercially permissive and high quality datasets.
 SEA-LION stands for _Southeast Asian Languages In One Network_.
 ## Model Details
 ### Base model
+We performed instruction tuning in English and Indonesian on our [pre-trained SEA-LION-7B](https://huggingface.co/aisingapore/sea-lion-7b), a decoder model using the MPT architecture, to create SEA-LION-7B-Instruct.
 ### Benchmark Performance
 We evaluated SEA-LION-7B-Instruct on the BHASA benchmark ([arXiv](https://arxiv.org/abs/2309.06085v2) and [GitHub](https://github.com/aisingapore/bhasa)) across a variety of tasks.
 ## Disclaimer
+This is the repository for the commercial instruction-tuned model.
 The model has _not_ been aligned for safety.
 Developers and users should perform their own safety fine-tuning and related security measures.
+In no event shall the authors be held liable for any claims, damages, or other liabilities arising from the use of the released weights and codes.