unsloth
/

DeepSeek-R1-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (3)

8bits quantization

#20 opened about 4 hours ago by

New research paper, R1 type reasoning models can be drastically improved in quality

#19 opened 3 days ago by

md5 / sha256 hashes please

#18 opened 5 days ago by

Is there a model removing non-shared MoE experts?

#17 opened 5 days ago by

A Step-by-step deployment guide with ollama

#16 opened 7 days ago by

No think tokens visible

#15 opened 7 days ago by

Over 2 tok/sec agg backed by NVMe SSD on 96GB RAM + 24GB VRAM AM5 rig with llama.cpp

#13 opened 8 days ago by

Running the model with vLLM does not actually work

#12 opened 8 days ago by

DeepSeek-R1-GGUF on LMStudio not available

#11 opened 8 days ago by

Where did the BF16 come from?

#10 opened 8 days ago by

Inference speed

#9 opened 9 days ago by

Running this model using vLLM Docker

#8 opened 9 days ago by

UD-IQ1_M models for distilled R1 versions?

#6 opened 9 days ago by

Llama.cpp server chat template

#4 opened 12 days ago by

Are the Q4 and Q5 models R1 or R1-Zero

#2 opened 16 days ago by

What is the VRAM requirement to run this ?

#1 opened 17 days ago by