DavidAU
/

DeepSeek-R1-Distill-Llama-3.1-16.5B-Brainstorm-gguf

Model card Files Files and versions Community

DavidAU commited on about 10 hours ago

Commit

208b3d6

·

verified ·

1 Parent(s): b3694b4

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -50,16 +50,18 @@ The "thinking/reasoning" tech (for the model at this repo) is from the original
 [ https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B ]
-In this case, Brainstorm 40x module was grafted directly onto "DeepSeek-R1-Distill-Llama-8B".
 For a completely different model (horror/creative), with only Deepseek's "thinking/reasoning" tech grafted into it see:
 [ https://huggingface.co/DavidAU/DeepSeek-Grand-Horror-SMB-R1-Distill-Llama-3.1-16B-GGUF ]
-V2 (larger, more uncensored, better "thought/reasoning" Deepseek function):
 [ https://huggingface.co/DavidAU/DeepSeek-V2-Grand-Horror-SMB-R1-Distill-Llama-3.1-Uncensored-16.5B-GGUF ]
 <b>CRITICAL SETTINGS:</B>
 1. Set Temp between 0 and .8, higher than this "think" functions will not activate. The most "stable" temp seems to be .6, with a variance of +-0.05. Lower for more "logic" reasoning, raise it for more "creative" reasoning (max .8 or so). Also set context to at least 4096, to account for "thoughts" generation.

 [ https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B ]
+In this case, Brainstorm 40x module was grafted directly onto "DeepSeek-R1-Distill-Llama-8B" bringing it up to 72 layers, 16.5B parameters.
 For a completely different model (horror/creative), with only Deepseek's "thinking/reasoning" tech grafted into it see:
 [ https://huggingface.co/DavidAU/DeepSeek-Grand-Horror-SMB-R1-Distill-Llama-3.1-16B-GGUF ]
+V2 (larger, more uncensored, better "thought/reasoning" Deepseek functions):
 [ https://huggingface.co/DavidAU/DeepSeek-V2-Grand-Horror-SMB-R1-Distill-Llama-3.1-Uncensored-16.5B-GGUF ]
+The Grand Horrors retain all of their "horror/creative power" and are augmented with Deepseek's "reasoning/thinking" systems.
 <b>CRITICAL SETTINGS:</B>
 1. Set Temp between 0 and .8, higher than this "think" functions will not activate. The most "stable" temp seems to be .6, with a variance of +-0.05. Lower for more "logic" reasoning, raise it for more "creative" reasoning (max .8 or so). Also set context to at least 4096, to account for "thoughts" generation.