Uploaded model

  • Developed by: fhai50032
  • License: apache-2.0
  • Finetuned from model : unsloth/Qwen2.5-7B-Instruct-unsloth-bnb-4bit

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
102
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for fhai50032/Qwen2.5-GRPO-7B

Base model

Qwen/Qwen2.5-7B
Finetuned
(2)
this model
Finetunes
1 model