DavidAU commited on
Commit
208b3d6
·
verified ·
1 Parent(s): b3694b4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -50,16 +50,18 @@ The "thinking/reasoning" tech (for the model at this repo) is from the original
50
 
51
  [ https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B ]
52
 
53
- In this case, Brainstorm 40x module was grafted directly onto "DeepSeek-R1-Distill-Llama-8B".
54
 
55
  For a completely different model (horror/creative), with only Deepseek's "thinking/reasoning" tech grafted into it see:
56
 
57
  [ https://huggingface.co/DavidAU/DeepSeek-Grand-Horror-SMB-R1-Distill-Llama-3.1-16B-GGUF ]
58
 
59
- V2 (larger, more uncensored, better "thought/reasoning" Deepseek function):
60
 
61
  [ https://huggingface.co/DavidAU/DeepSeek-V2-Grand-Horror-SMB-R1-Distill-Llama-3.1-Uncensored-16.5B-GGUF ]
62
 
 
 
63
  <b>CRITICAL SETTINGS:</B>
64
 
65
  1. Set Temp between 0 and .8, higher than this "think" functions will not activate. The most "stable" temp seems to be .6, with a variance of +-0.05. Lower for more "logic" reasoning, raise it for more "creative" reasoning (max .8 or so). Also set context to at least 4096, to account for "thoughts" generation.
 
50
 
51
  [ https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B ]
52
 
53
+ In this case, Brainstorm 40x module was grafted directly onto "DeepSeek-R1-Distill-Llama-8B" bringing it up to 72 layers, 16.5B parameters.
54
 
55
  For a completely different model (horror/creative), with only Deepseek's "thinking/reasoning" tech grafted into it see:
56
 
57
  [ https://huggingface.co/DavidAU/DeepSeek-Grand-Horror-SMB-R1-Distill-Llama-3.1-16B-GGUF ]
58
 
59
+ V2 (larger, more uncensored, better "thought/reasoning" Deepseek functions):
60
 
61
  [ https://huggingface.co/DavidAU/DeepSeek-V2-Grand-Horror-SMB-R1-Distill-Llama-3.1-Uncensored-16.5B-GGUF ]
62
 
63
+ The Grand Horrors retain all of their "horror/creative power" and are augmented with Deepseek's "reasoning/thinking" systems.
64
+
65
  <b>CRITICAL SETTINGS:</B>
66
 
67
  1. Set Temp between 0 and .8, higher than this "think" functions will not activate. The most "stable" temp seems to be .6, with a variance of +-0.05. Lower for more "logic" reasoning, raise it for more "creative" reasoning (max .8 or so). Also set context to at least 4096, to account for "thoughts" generation.