ericrisco
/

salamandra-2b-r1

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ericrisco commited on 4 days ago

Commit

4743df1

·

verified ·

1 Parent(s): 93c9278

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ datasets:
 - ericrisco/gsm8k-translated-spanish
 - openai/gsm8k
 ---
-# Salamandra Model Card
 Salamandra is a highly multilingual model pre-trained from scratch that comes in different sizes. This model card corresponds to the **2B instructed version**, fine-tuned using **GRPO (Group Reward Policy Optimization)** and **Unsloth**.

 - ericrisco/gsm8k-translated-spanish
 - openai/gsm8k
 ---
+# Salamandra 2B Reasoning R1 Model Card
 Salamandra is a highly multilingual model pre-trained from scratch that comes in different sizes. This model card corresponds to the **2B instructed version**, fine-tuned using **GRPO (Group Reward Policy Optimization)** and **Unsloth**.