Triangle104
/

DS-R1-Distill-Q2.5-14B-Harmony_V0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

merge

This is a merge of pre-trained language models created using mergekit.

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2
  - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
merge_method: slerp
base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
dtype: bfloat16
parameters:
  t: [0, 0.5, 0.7, 1, 0.7, 0.5, 0]

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	35.74
IFEval (0-Shot)	45.15
BBH (3-Shot)	38.72
MATH Lvl 5 (4-Shot)	39.50
GPQA (0-shot)	19.13
MuSR (0-shot)	31.92
MMLU-PRO (5-shot)	40.01

Downloads last month: 170

Safetensors

Model size

14.8B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for Triangle104/DS-R1-Distill-Q2.5-14B-Harmony_V0.1

deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2

Merge model

this model

Quantizations

Collections including Triangle104/DS-R1-Distill-Q2.5-14B-Harmony_V0.1

Qwen

Alibaba Cloud-based models • 1081 items • Updated about 2 hours ago • 4

Merges

Personal Merges • 94 items • Updated about 2 hours ago • 1

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

45.150
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

38.720
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

39.500
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

19.130
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

31.920
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

40.010

View on Papers With Code