Uploaded model

Developed by: fhai50032
License: apache-2.0
Finetuned from model : fhai50032/Qwen2.5-GRPO-7B

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model’s pipeline type.

Model tree for fhai50032/Qwen-Thinking-7b-LORA

Base model

unsloth/Qwen2.5-7B-Instruct-unsloth-bnb-4bit

Finetuned

fhai50032/Qwen2.5-GRPO-7B

Finetuned

(1)

this model