Update README.md
Browse files
README.md
CHANGED
@@ -1,15 +1,17 @@
|
|
1 |
---
|
2 |
base_model: Qwen/Qwen2.5-1.5B-Instruct
|
3 |
library_name: transformers
|
4 |
-
model_name:
|
5 |
tags:
|
6 |
- generated_from_trainer
|
7 |
- trl
|
8 |
- grpo
|
9 |
licence: license
|
|
|
|
|
10 |
---
|
11 |
|
12 |
-
# Model Card for
|
13 |
|
14 |
This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct).
|
15 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|
@@ -19,8 +21,8 @@ It has been trained using [TRL](https://github.com/huggingface/trl).
|
|
19 |
```python
|
20 |
from transformers import pipeline
|
21 |
|
22 |
-
question = "
|
23 |
-
generator = pipeline("text-generation", model="justinj92/
|
24 |
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
|
25 |
print(output["generated_text"])
|
26 |
```
|
|
|
1 |
---
|
2 |
base_model: Qwen/Qwen2.5-1.5B-Instruct
|
3 |
library_name: transformers
|
4 |
+
model_name: Qwen2.5-1.5B-Thinking
|
5 |
tags:
|
6 |
- generated_from_trainer
|
7 |
- trl
|
8 |
- grpo
|
9 |
licence: license
|
10 |
+
datasets:
|
11 |
+
- microsoft/orca-math-word-problems-200k
|
12 |
---
|
13 |
|
14 |
+
# Model Card for Qwen2.5-1.5B-Thinking
|
15 |
|
16 |
This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct).
|
17 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|
|
|
21 |
```python
|
22 |
from transformers import pipeline
|
23 |
|
24 |
+
question = "Mia can decorate 2 dozen Easter eggs per hour. Her little brother Billy can only decorate 10 eggs per hour. They need to decorate 170 eggs for the Easter egg hunt. If they work together, how long will it take them to decorate all the eggs?"
|
25 |
+
generator = pipeline("text-generation", model="justinj92/Qwen2.5-1.5B-Thinking", device="cuda")
|
26 |
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
|
27 |
print(output["generated_text"])
|
28 |
```
|