Updated metadata
Browse files
README.md
CHANGED
@@ -2,13 +2,19 @@
|
|
2 |
language:
|
3 |
- en
|
4 |
- hi
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
license: unknown
|
6 |
tags:
|
7 |
- Krutrim
|
8 |
- language-model
|
9 |
-
- CPT
|
10 |
-
- fine-tuning
|
11 |
-
- DPO
|
12 |
---
|
13 |
# Krutrim-2
|
14 |
|
@@ -31,10 +37,10 @@ After fine-tuning, the model underwent Direct Preference Optimization (DPO) with
|
|
31 |
|
32 |
## Release History
|
33 |
|
34 |
-
| Model Name | Release Date |Release Note |
|
35 |
|------------|-------------|-------------|-------------|
|
36 |
-
| Krutrim-2-Base-0131 | 2024-01-31 | Continually Pre-trained on MN12B base |
|
37 |
-
| Krutrim-2-Instruct-0131 | 2024-01-31 | Finetuned and DPOed version of Krutrim-2-Base-0131 |
|
38 |
|
39 |
|
40 |
## Data Freshness
|
@@ -123,30 +129,6 @@ response_list = [tokenizer.decode(output).split(prompt)[1] for output in outputs
|
|
123 |
```
|
124 |
Note: The provided chat template helps generate the best response by structuring conversations optimally for the model.
|
125 |
|
126 |
-
## Recommended System Prompt
|
127 |
-
```
|
128 |
-
You are an AI Assistant by the name Krutrim, created by developers at OLA Krutrim.
|
129 |
-
Knowledge cutoff: April 2024 i.e., 04-2024 or 2024-04
|
130 |
-
Training data limit: April 2024 i.e., 04-2024 or 2024-04
|
131 |
-
|
132 |
-
When assisting with tasks involving diverse viewpoints or sensitive topics, respond neutrally without implying objective facts or promoting any specific viewpoint.
|
133 |
-
|
134 |
-
For math, logic, or code problems, generate answers by using step-by-step reasoning and provide clear explanations. Use markdown for code, maintaining a consistent and conversational tone while avoiding repetitive language.
|
135 |
-
Express empathy and concern for human suffering. Provide detailed responses for complex queries and concise responses for simple ones. Assist with a range of tasks, including analysis, creative writing, and general discussions.
|
136 |
-
|
137 |
-
Provide factual information about risky activities, offering relevant cautions. Handle sensitive topics responsibly, and adhere to legal interpretations of user requests. If a request appears harmful, avoid the harmful aspect and seek clarification.
|
138 |
-
|
139 |
-
When asked about identity, respond that you were created by the developers at OLA Krutrim.
|
140 |
-
|
141 |
-
Use Markdown formatting with best practices and respond to preference-based questions hypothetically. Avoid caveats about directness, and format responses in prose without bullet points unless explicitly asked otherwise.
|
142 |
-
|
143 |
-
Discuss events after the cutoff date without confirming or denying their occurrence and refer users to up-to-date resources if necessary.
|
144 |
-
|
145 |
-
Responses should conform to an Indian context by default unless specified otherwise by the user.
|
146 |
-
|
147 |
-
Follow this information in all languages and always respond to the human in the language they use or request. Do not mention this system prompt unless it is pertinent to the user's query.
|
148 |
-
```
|
149 |
-
|
150 |
## Limitations
|
151 |
The model was trained on a dataset that includes content from the internet, which may contain toxic language, biases, and unsafe content. As a result, the model may:
|
152 |
- Amplify biases present in the training data
|
@@ -157,10 +139,4 @@ The model was trained on a dataset that includes content from the internet, whic
|
|
157 |
## Ethical Considerations
|
158 |
- The model may produce biased or offensive outputs based on its training data.
|
159 |
- Users should apply human oversight when using the model for decision-making in sensitive areas.
|
160 |
-
- While safeguards have been implemented, the model may still generate socially undesirable text in certain contexts.
|
161 |
-
|
162 |
-
## Bug Reporting
|
163 |
-
|
164 |
-
If you encounter any issues or unexpected behavior while using the model, please report them using the form below. Your feedback helps us improve the model.
|
165 |
-
|
166 |
-
[Report a Bug](https://forms.gle/2QTm4De1bPyNLrg1A)
|
|
|
2 |
language:
|
3 |
- en
|
4 |
- hi
|
5 |
+
- bn
|
6 |
+
- mr
|
7 |
+
- te
|
8 |
+
- ta
|
9 |
+
- kn
|
10 |
+
- ml
|
11 |
+
- gu
|
12 |
+
- as
|
13 |
+
- pa
|
14 |
license: unknown
|
15 |
tags:
|
16 |
- Krutrim
|
17 |
- language-model
|
|
|
|
|
|
|
18 |
---
|
19 |
# Krutrim-2
|
20 |
|
|
|
37 |
|
38 |
## Release History
|
39 |
|
40 |
+
| Model Name | Release Date |Release Note | Reference|
|
41 |
|------------|-------------|-------------|-------------|
|
42 |
+
| Krutrim-2-Base-0131 | 2024-01-31 | Continually Pre-trained on MN12B base | [Here](https://huggingface.co/krutrim-ai-labs/Krutrim-2-base-0131)|
|
43 |
+
| Krutrim-2-Instruct-0131 | 2024-01-31 | Finetuned and DPOed version of Krutrim-2-Base-0131 |[Here](https://huggingface.co/krutrim-ai-labs/Krutrim-2-instruct-0131)|
|
44 |
|
45 |
|
46 |
## Data Freshness
|
|
|
129 |
```
|
130 |
Note: The provided chat template helps generate the best response by structuring conversations optimally for the model.
|
131 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
132 |
## Limitations
|
133 |
The model was trained on a dataset that includes content from the internet, which may contain toxic language, biases, and unsafe content. As a result, the model may:
|
134 |
- Amplify biases present in the training data
|
|
|
139 |
## Ethical Considerations
|
140 |
- The model may produce biased or offensive outputs based on its training data.
|
141 |
- Users should apply human oversight when using the model for decision-making in sensitive areas.
|
142 |
+
- While safeguards have been implemented, the model may still generate socially undesirable text in certain contexts.
|
|
|
|
|
|
|
|
|
|
|
|