Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ datasets:
|
|
15 |
- ericrisco/gsm8k-translated-spanish
|
16 |
- openai/gsm8k
|
17 |
---
|
18 |
-
# Salamandra Model Card
|
19 |
|
20 |
Salamandra is a highly multilingual model pre-trained from scratch that comes in different sizes. This model card corresponds to the **2B instructed version**, fine-tuned using **GRPO (Group Reward Policy Optimization)** and **Unsloth**.
|
21 |
|
|
|
15 |
- ericrisco/gsm8k-translated-spanish
|
16 |
- openai/gsm8k
|
17 |
---
|
18 |
+
# Salamandra 2B Reasoning R1 Model Card
|
19 |
|
20 |
Salamandra is a highly multilingual model pre-trained from scratch that comes in different sizes. This model card corresponds to the **2B instructed version**, fine-tuned using **GRPO (Group Reward Policy Optimization)** and **Unsloth**.
|
21 |
|