Khetterman
/

Kosmos-8B-v1-GGUF

Text Generation

Not-For-All-Audiences

Inference Endpoints

Model card Files Files and versions Community

Kosmos-8B-v1 GGUF Quantizations 🗲

The serenity of infinity is not the end.

This model was converted to GGUF format using llama.cpp.

For more information of the model, see the original model card: Khetterman/Kosmos-8B-v1.

Available Quantizations (◕‿◕)

Type	Quantized GGUF Model	Size
Q4_0	Khetterman/Kosmos-8B-v1-Q4_0.gguf	4.34 GiB
Q6_K	Khetterman/Kosmos-8B-v1-Q6_K.gguf	6.14 GiB
Q8_0	Khetterman/Kosmos-8B-v1-Q8_0.gguf	7.95 GiB

My thanks to the authors of the original models, your work is incredible. Have a good time 🖤

Downloads last month: 42

GGUF

Model size

8.03B params

Architecture

llama

4-bit

6-bit

8-bit

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for Khetterman/Kosmos-8B-v1-GGUF

Base model

Khetterman/Kosmos-8B-v1

Quantized

(7)

this model