ddh0's picture
Update README.md
af42041 verified
---
license: other
license_name: qwen
license_link: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct/blob/main/LICENSE
language:
- en
pipeline_tag: text-generation
base_model:
- Qwen/Qwen2.5-72B
- Qwen/Qwen2.5-72B-Instruct
base_model_relation: merge
tags:
- chat
library_name: transformers
---
# Qwen2.5-72B-0.6x-Instruct
This is a linear merge of [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct) at weight `0.6` and [Qwen/Qwen2.5-72B](https://huggingface.co/Qwen/Qwen2.5-72B) at weight `0.4`.
The resulting model is 60% Instruct and 40% base model, hence the name **`0.6x-Instruct`**.
The goal of the merge was to make the Instruct model more flexible and less rigid. After some initial testing, I think the resulting model meets this goal, and I find it useful and interesting enough to warrant publishing.