Text Generation
GGUF
English
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
bfloat16
swearing
role play
sillytavern
backyard
horror
llama 3.1
context 128k
mergekit
Inference Endpoints
conversational
Update README.md
Browse files
README.md
CHANGED
@@ -50,16 +50,18 @@ The "thinking/reasoning" tech (for the model at this repo) is from the original
|
|
50 |
|
51 |
[ https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B ]
|
52 |
|
53 |
-
In this case, Brainstorm 40x module was grafted directly onto "DeepSeek-R1-Distill-Llama-8B".
|
54 |
|
55 |
For a completely different model (horror/creative), with only Deepseek's "thinking/reasoning" tech grafted into it see:
|
56 |
|
57 |
[ https://huggingface.co/DavidAU/DeepSeek-Grand-Horror-SMB-R1-Distill-Llama-3.1-16B-GGUF ]
|
58 |
|
59 |
-
V2 (larger, more uncensored, better "thought/reasoning" Deepseek
|
60 |
|
61 |
[ https://huggingface.co/DavidAU/DeepSeek-V2-Grand-Horror-SMB-R1-Distill-Llama-3.1-Uncensored-16.5B-GGUF ]
|
62 |
|
|
|
|
|
63 |
<b>CRITICAL SETTINGS:</B>
|
64 |
|
65 |
1. Set Temp between 0 and .8, higher than this "think" functions will not activate. The most "stable" temp seems to be .6, with a variance of +-0.05. Lower for more "logic" reasoning, raise it for more "creative" reasoning (max .8 or so). Also set context to at least 4096, to account for "thoughts" generation.
|
|
|
50 |
|
51 |
[ https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B ]
|
52 |
|
53 |
+
In this case, Brainstorm 40x module was grafted directly onto "DeepSeek-R1-Distill-Llama-8B" bringing it up to 72 layers, 16.5B parameters.
|
54 |
|
55 |
For a completely different model (horror/creative), with only Deepseek's "thinking/reasoning" tech grafted into it see:
|
56 |
|
57 |
[ https://huggingface.co/DavidAU/DeepSeek-Grand-Horror-SMB-R1-Distill-Llama-3.1-16B-GGUF ]
|
58 |
|
59 |
+
V2 (larger, more uncensored, better "thought/reasoning" Deepseek functions):
|
60 |
|
61 |
[ https://huggingface.co/DavidAU/DeepSeek-V2-Grand-Horror-SMB-R1-Distill-Llama-3.1-Uncensored-16.5B-GGUF ]
|
62 |
|
63 |
+
The Grand Horrors retain all of their "horror/creative power" and are augmented with Deepseek's "reasoning/thinking" systems.
|
64 |
+
|
65 |
<b>CRITICAL SETTINGS:</B>
|
66 |
|
67 |
1. Set Temp between 0 and .8, higher than this "think" functions will not activate. The most "stable" temp seems to be .6, with a variance of +-0.05. Lower for more "logic" reasoning, raise it for more "creative" reasoning (max .8 or so). Also set context to at least 4096, to account for "thoughts" generation.
|