PyTorch
mistral
Krutrim
language-model
krutrim-admin commited on
Commit
08b869e
·
verified ·
1 Parent(s): 8578a9e

Updated metadata

Browse files
Files changed (1) hide show
  1. README.md +13 -37
README.md CHANGED
@@ -2,13 +2,19 @@
2
  language:
3
  - en
4
  - hi
 
 
 
 
 
 
 
 
 
5
  license: unknown
6
  tags:
7
  - Krutrim
8
  - language-model
9
- - CPT
10
- - fine-tuning
11
- - DPO
12
  ---
13
  # Krutrim-2
14
 
@@ -31,10 +37,10 @@ After fine-tuning, the model underwent Direct Preference Optimization (DPO) with
31
 
32
  ## Release History
33
 
34
- | Model Name | Release Date |Release Note | Path|
35
  |------------|-------------|-------------|-------------|
36
- | Krutrim-2-Base-0131 | 2024-01-31 | Continually Pre-trained on MN12B base | s3://krutrim2llm/releases/base/0131/|
37
- | Krutrim-2-Instruct-0131 | 2024-01-31 | Finetuned and DPOed version of Krutrim-2-Base-0131 |s3://krutrim2llm/releases/instruct/0131/|
38
 
39
 
40
  ## Data Freshness
@@ -123,30 +129,6 @@ response_list = [tokenizer.decode(output).split(prompt)[1] for output in outputs
123
  ```
124
  Note: The provided chat template helps generate the best response by structuring conversations optimally for the model.
125
 
126
- ## Recommended System Prompt
127
- ```
128
- You are an AI Assistant by the name Krutrim, created by developers at OLA Krutrim.
129
- Knowledge cutoff: April 2024 i.e., 04-2024 or 2024-04
130
- Training data limit: April 2024 i.e., 04-2024 or 2024-04
131
-
132
- When assisting with tasks involving diverse viewpoints or sensitive topics, respond neutrally without implying objective facts or promoting any specific viewpoint.
133
-
134
- For math, logic, or code problems, generate answers by using step-by-step reasoning and provide clear explanations. Use markdown for code, maintaining a consistent and conversational tone while avoiding repetitive language.
135
- Express empathy and concern for human suffering. Provide detailed responses for complex queries and concise responses for simple ones. Assist with a range of tasks, including analysis, creative writing, and general discussions.
136
-
137
- Provide factual information about risky activities, offering relevant cautions. Handle sensitive topics responsibly, and adhere to legal interpretations of user requests. If a request appears harmful, avoid the harmful aspect and seek clarification.
138
-
139
- When asked about identity, respond that you were created by the developers at OLA Krutrim.
140
-
141
- Use Markdown formatting with best practices and respond to preference-based questions hypothetically. Avoid caveats about directness, and format responses in prose without bullet points unless explicitly asked otherwise.
142
-
143
- Discuss events after the cutoff date without confirming or denying their occurrence and refer users to up-to-date resources if necessary.
144
-
145
- Responses should conform to an Indian context by default unless specified otherwise by the user.
146
-
147
- Follow this information in all languages and always respond to the human in the language they use or request. Do not mention this system prompt unless it is pertinent to the user's query.
148
- ```
149
-
150
  ## Limitations
151
  The model was trained on a dataset that includes content from the internet, which may contain toxic language, biases, and unsafe content. As a result, the model may:
152
  - Amplify biases present in the training data
@@ -157,10 +139,4 @@ The model was trained on a dataset that includes content from the internet, whic
157
  ## Ethical Considerations
158
  - The model may produce biased or offensive outputs based on its training data.
159
  - Users should apply human oversight when using the model for decision-making in sensitive areas.
160
- - While safeguards have been implemented, the model may still generate socially undesirable text in certain contexts.
161
-
162
- ## Bug Reporting
163
-
164
- If you encounter any issues or unexpected behavior while using the model, please report them using the form below. Your feedback helps us improve the model.
165
-
166
- [Report a Bug](https://forms.gle/2QTm4De1bPyNLrg1A)
 
2
  language:
3
  - en
4
  - hi
5
+ - bn
6
+ - mr
7
+ - te
8
+ - ta
9
+ - kn
10
+ - ml
11
+ - gu
12
+ - as
13
+ - pa
14
  license: unknown
15
  tags:
16
  - Krutrim
17
  - language-model
 
 
 
18
  ---
19
  # Krutrim-2
20
 
 
37
 
38
  ## Release History
39
 
40
+ | Model Name | Release Date |Release Note | Reference|
41
  |------------|-------------|-------------|-------------|
42
+ | Krutrim-2-Base-0131 | 2024-01-31 | Continually Pre-trained on MN12B base | [Here](https://huggingface.co/krutrim-ai-labs/Krutrim-2-base-0131)|
43
+ | Krutrim-2-Instruct-0131 | 2024-01-31 | Finetuned and DPOed version of Krutrim-2-Base-0131 |[Here](https://huggingface.co/krutrim-ai-labs/Krutrim-2-instruct-0131)|
44
 
45
 
46
  ## Data Freshness
 
129
  ```
130
  Note: The provided chat template helps generate the best response by structuring conversations optimally for the model.
131
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
132
  ## Limitations
133
  The model was trained on a dataset that includes content from the internet, which may contain toxic language, biases, and unsafe content. As a result, the model may:
134
  - Amplify biases present in the training data
 
139
  ## Ethical Considerations
140
  - The model may produce biased or offensive outputs based on its training data.
141
  - Users should apply human oversight when using the model for decision-making in sensitive areas.
142
+ - While safeguards have been implemented, the model may still generate socially undesirable text in certain contexts.