justinj92
/

Qwen2.5-1.5B-Thinking

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

justinj92 commited on 9 days ago

Commit

88d842f

·

verified ·

1 Parent(s): ad3b2c3

Update README.md

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -1,15 +1,17 @@
 ---
 base_model: Qwen/Qwen2.5-1.5B-Instruct
 library_name: transformers
-model_name: Qwen-1.5B-GRPO
 tags:
 - generated_from_trainer
 - trl
 - grpo
 licence: license
 ---
-# Model Card for Qwen-1.5B-GRPO
 This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct).
 It has been trained using [TRL](https://github.com/huggingface/trl).
@@ -19,8 +21,8 @@ It has been trained using [TRL](https://github.com/huggingface/trl).
 ```python
 from transformers import pipeline
-question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
-generator = pipeline("text-generation", model="justinj92/Qwen-1.5B-GRPO", device="cuda")
 output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
 print(output["generated_text"])
 ```

 ---
 base_model: Qwen/Qwen2.5-1.5B-Instruct
 library_name: transformers
+model_name: Qwen2.5-1.5B-Thinking
 tags:
 - generated_from_trainer
 - trl
 - grpo
 licence: license
+datasets:
+- microsoft/orca-math-word-problems-200k
 ---
+# Model Card for Qwen2.5-1.5B-Thinking
 This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ```python
 from transformers import pipeline
+question = "Mia can decorate 2 dozen Easter eggs per hour. Her little brother Billy can only decorate 10 eggs per hour. They need to decorate 170 eggs for the Easter egg hunt. If they work together, how long will it take them to decorate all the eggs?"
+generator = pipeline("text-generation", model="justinj92/Qwen2.5-1.5B-Thinking", device="cuda")
 output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
 print(output["generated_text"])
 ```