You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

decisionslab/DeepSeek-R1-Distill-Llama-8B-4bit-HK

The Model decisionslab/DeepSeek-R1-Distill-Llama-8B-4bit-HK was converted to MLX format from deepseek-ai/DeepSeek-R1-Distill-Llama-8B using mlx-lm version 0.21.1.

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("decisionslab/DeepSeek-R1-Distill-Llama-8B-4bit-HK")

prompt = "hello"

if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, add_generation_prompt=True
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)
Downloads last month
0
Safetensors
Model size
1.25B params
Tensor type
FP16
·
U32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for decisionslab/DeepSeek-R1-Distill-Llama-8B-4bit-HK

Quantized
(98)
this model

Collection including decisionslab/DeepSeek-R1-Distill-Llama-8B-4bit-HK