Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RichardErkhov
/
wang7776_-_vicuna-7b-v1.3-attention-sparsity-20-gguf
like
0
GGUF
Inference Endpoints
arxiv:
2306.11695
arxiv:
2302.13971
arxiv:
2306.05685
Model card
Files
Files and versions
Community
Deploy
Use this model
94a7251
wang7776_-_vicuna-7b-v1.3-attention-sparsity-20-gguf
1 contributor
History:
11 commits
RichardErkhov
uploaded model
94a7251
verified
7 months ago
.gitattributes
Safe
2.36 kB
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ3_M.gguf
Safe
3.11 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ3_S.gguf
Safe
2.95 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ3_XS.gguf
Safe
2.8 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ4_XS.gguf
Safe
3.65 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q2_K.gguf
Safe
2.53 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q3_K.gguf
Safe
3.3 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q3_K_L.gguf
Safe
3.6 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q3_K_M.gguf
Safe
3.3 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q3_K_S.gguf
Safe
2.95 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q4_0.gguf
Safe
3.83 GB
LFS
uploaded model
7 months ago