Uploaded model
- Developed by: fhai50032
- License: apache-2.0
- Finetuned from model : fhai50032/Qwen2.5-GRPO-7B
This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
HF Inference API was unable to determine this model’s pipeline type.
Model tree for fhai50032/Qwen-Thinking-7b-LORA
Base model
unsloth/Qwen2.5-7B-Instruct-unsloth-bnb-4bit
Finetuned
fhai50032/Qwen2.5-GRPO-7B