Triangle104
/

Q2.5-32B-Slush-Q4_K_S-GGUF

Not-For-All-Audiences

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 28 days ago

Commit

89257ff

·

verified ·

1 Parent(s): ff9562b

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -20,6 +20,9 @@ license: apache-2.0
 This model was converted to GGUF format from [`crestf411/Q2.5-32B-Slush`](https://huggingface.co/crestf411/Q2.5-32B-Slush) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/crestf411/Q2.5-32B-Slush) for more details on the model.
 Slush is a two-stage model trained with high LoRA dropout, where stage 1 is a pretraining continuation on the base model, aimed at boosting the model's creativity and writing capabilities. This is then merged into the instruction tune model, and stage 2 is a fine tuning step on top of this to further enhance its roleplaying capabilities and/or to repair any damage caused in the stage 1 merge.
 This is still early stage. As always, feedback is welcome, and begone if you demand perfection.
@@ -84,6 +87,7 @@ parameters:
 tokenizer_source: Qwen/Qwen2.5-32B-Instruct
 dtype: bfloat16
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`crestf411/Q2.5-32B-Slush`](https://huggingface.co/crestf411/Q2.5-32B-Slush) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/crestf411/Q2.5-32B-Slush) for more details on the model.
+---
+Model details:
+-
 Slush is a two-stage model trained with high LoRA dropout, where stage 1 is a pretraining continuation on the base model, aimed at boosting the model's creativity and writing capabilities. This is then merged into the instruction tune model, and stage 2 is a fine tuning step on top of this to further enhance its roleplaying capabilities and/or to repair any damage caused in the stage 1 merge.
 This is still early stage. As always, feedback is welcome, and begone if you demand perfection.
 tokenizer_source: Qwen/Qwen2.5-32B-Instruct
 dtype: bfloat16
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)