Kosmos-8B-v1 GGUF Quantizations πŸ—²

The serenity of infinity is not the end.

KosmosLogo256.png

This model was converted to GGUF format using llama.cpp.

For more information of the model, see the original model card: Khetterman/Kosmos-8B-v1.

Available Quantizations (β—•β€Ώβ—•)

Type Quantized GGUF Model Size
Q4_0 Khetterman/Kosmos-8B-v1-Q4_0.gguf 4.34 GiB
Q6_K Khetterman/Kosmos-8B-v1-Q6_K.gguf 6.14 GiB
Q8_0 Khetterman/Kosmos-8B-v1-Q8_0.gguf 7.95 GiB

My thanks to the authors of the original models, your work is incredible. Have a good time πŸ–€

Downloads last month
42
GGUF
Model size
8.03B params
Architecture
llama

4-bit

6-bit

8-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for Khetterman/Kosmos-8B-v1-GGUF

Quantized
(7)
this model