This is the LLaMAfied version of Qwen-14B-Chat model by Alibaba Cloud.

This model is converted with https://github.com/hiyouga/LLaMA-Factory/blob/main/tests/llamafy_qwen.py

The tokenizer is borrowed from https://huggingface.co/CausalLM/72B-preview-llamafied-qwen-llamafy

You may use this model for fine-tuning in downstream tasks, we recommend using our efficient fine-tuning toolkit. https://github.com/hiyouga/LLaMA-Factory

Usage:

from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer

tokenizer = AutoTokenizer.from_pretrained("hiyouga/Qwen-14B-Chat-LLaMAfied")
model = AutoModelForCausalLM.from_pretrained("hiyouga/Qwen-14B-Chat-LLaMAfied", torch_dtype="auto", device_map="auto")
streamer = TextStreamer(tokenizer, skip_prompt=True, skip_special_tokens=True)

messages = [
    {"role": "user", "content": "Who are you?"}
]
inputs = tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True, return_tensors="pt")
inputs = inputs.to("cuda")
generate_ids = model.generate(inputs, streamer=streamer)

You could also alternatively launch a CLI demo by using the script in LLaMA-Factory

python src/cli_demo.py --template qwen --model_name_or_path hiyouga/Qwen-14B-Chat-LLaMAfied

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 61.60
AI2 Reasoning Challenge (25-Shot) 57.51
HellaSwag (10-Shot) 82.11
MMLU (5-Shot) 65.57
TruthfulQA (0-shot) 51.99
Winogrande (5-shot) 72.93
GSM8k (5-shot) 39.50
Downloads last month
1,740
Safetensors
Model size
14.2B params
Tensor type
BF16
Β·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for hiyouga/Qwen-14B-Chat-LLaMAfied

Quantizations
1 model

Spaces using hiyouga/Qwen-14B-Chat-LLaMAfied 3

Evaluation results