Anthonyg5005 commited on
Commit
ada1508
·
1 Parent(s): 23d4a77

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -8
README.md CHANGED
@@ -25,22 +25,27 @@ This repository includes the quantized models for the [meow](https://huggingface
25
 
26
  ## Current models
27
 
28
- | exl2 BPW | Model Branch | Model Size |
29
- |-|-|-|
30
- | 2-Bit | main | 3.28 GB |
31
- | 4-Bit | 4bit | 5.61 GB |
32
- | 5-Bit | 5bit | 6.92 GB |
33
- | 6-Bit | 6bit | 8.23 GB |
34
- | 8-Bit | 8bit | 10.84 GB |
 
 
 
 
35
 
36
  ## Where to use
37
 
38
  There are a couple places you can use an exl2 model, here are a few:
39
 
40
  - [oobabooga's Text Gen Webui](https://github.com/oobabooga/text-generation-webui)
 
41
  - [tabbyAPI](https://github.com/theroyallab/tabbyAPI)
42
  - [ExUI](https://github.com/turboderp/exui)
43
  - [KoboldAI](https://github.com/henk717/KoboldAI) (Clone repo, don't use snapshot)
44
 
45
  # WARNING
46
- Model cannot be used commercially due to the Alpaca dataset license.
 
25
 
26
  ## Current models
27
 
28
+ | exl2 BPW | Model Branch | Model Size | Minimum VRAM (4096 Context) |
29
+ |-|-|-|-|
30
+ | 2-Bit | main | 3.28 GB | 6GB GPU |
31
+ | 4-Bit | 4bit | 5.61 GB | 8GB GPU |
32
+ | 5-Bit | 5bit | 6.92 GB | 10GB GPU, 8GB with swap |
33
+ | 6-Bit | 6bit | 8.23 GB | 10GB GPU |
34
+ | 8-Bit | 8bit | 10.84 GB | 12GB GPU |
35
+
36
+ ### Note
37
+
38
+ Using a 12GB Nvidia GeForce RTX 3060 I got on average around 20 tokens per second on the 8-bit quant in full 4096 context.
39
 
40
  ## Where to use
41
 
42
  There are a couple places you can use an exl2 model, here are a few:
43
 
44
  - [oobabooga's Text Gen Webui](https://github.com/oobabooga/text-generation-webui)
45
+ - When using the downloader, make sure to format like this: Anthonyg5005/rishiraj-meow-10.7B-exl2**\:QuantBranch**
46
  - [tabbyAPI](https://github.com/theroyallab/tabbyAPI)
47
  - [ExUI](https://github.com/turboderp/exui)
48
  - [KoboldAI](https://github.com/henk717/KoboldAI) (Clone repo, don't use snapshot)
49
 
50
  # WARNING
51
+ Model cannot be used commercially due to the Alpaca dataset license. Only use this model for research purposes or personal use.