Anthonyg5005
commited on
Commit
·
ada1508
1
Parent(s):
23d4a77
Update README.md
Browse files
README.md
CHANGED
@@ -25,22 +25,27 @@ This repository includes the quantized models for the [meow](https://huggingface
|
|
25 |
|
26 |
## Current models
|
27 |
|
28 |
-
| exl2 BPW | Model Branch | Model Size |
|
29 |
-
|
30 |
-
| 2-Bit | main | 3.28 GB |
|
31 |
-
| 4-Bit | 4bit | 5.61 GB |
|
32 |
-
| 5-Bit | 5bit | 6.92 GB |
|
33 |
-
| 6-Bit | 6bit | 8.23 GB |
|
34 |
-
| 8-Bit | 8bit | 10.84 GB |
|
|
|
|
|
|
|
|
|
35 |
|
36 |
## Where to use
|
37 |
|
38 |
There are a couple places you can use an exl2 model, here are a few:
|
39 |
|
40 |
- [oobabooga's Text Gen Webui](https://github.com/oobabooga/text-generation-webui)
|
|
|
41 |
- [tabbyAPI](https://github.com/theroyallab/tabbyAPI)
|
42 |
- [ExUI](https://github.com/turboderp/exui)
|
43 |
- [KoboldAI](https://github.com/henk717/KoboldAI) (Clone repo, don't use snapshot)
|
44 |
|
45 |
# WARNING
|
46 |
-
Model cannot be used commercially due to the Alpaca dataset license.
|
|
|
25 |
|
26 |
## Current models
|
27 |
|
28 |
+
| exl2 BPW | Model Branch | Model Size | Minimum VRAM (4096 Context) |
|
29 |
+
|-|-|-|-|
|
30 |
+
| 2-Bit | main | 3.28 GB | 6GB GPU |
|
31 |
+
| 4-Bit | 4bit | 5.61 GB | 8GB GPU |
|
32 |
+
| 5-Bit | 5bit | 6.92 GB | 10GB GPU, 8GB with swap |
|
33 |
+
| 6-Bit | 6bit | 8.23 GB | 10GB GPU |
|
34 |
+
| 8-Bit | 8bit | 10.84 GB | 12GB GPU |
|
35 |
+
|
36 |
+
### Note
|
37 |
+
|
38 |
+
Using a 12GB Nvidia GeForce RTX 3060 I got on average around 20 tokens per second on the 8-bit quant in full 4096 context.
|
39 |
|
40 |
## Where to use
|
41 |
|
42 |
There are a couple places you can use an exl2 model, here are a few:
|
43 |
|
44 |
- [oobabooga's Text Gen Webui](https://github.com/oobabooga/text-generation-webui)
|
45 |
+
- When using the downloader, make sure to format like this: Anthonyg5005/rishiraj-meow-10.7B-exl2**\:QuantBranch**
|
46 |
- [tabbyAPI](https://github.com/theroyallab/tabbyAPI)
|
47 |
- [ExUI](https://github.com/turboderp/exui)
|
48 |
- [KoboldAI](https://github.com/henk717/KoboldAI) (Clone repo, don't use snapshot)
|
49 |
|
50 |
# WARNING
|
51 |
+
Model cannot be used commercially due to the Alpaca dataset license. Only use this model for research purposes or personal use.
|