Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RichardErkhov
/
wang7776_-_vicuna-7b-v1.3-attention-sparsity-20-gguf
like
0
GGUF
Inference Endpoints
arxiv:
2306.11695
arxiv:
2302.13971
arxiv:
2306.05685
Model card
Files
Files and versions
Community
Deploy
Use this model
967c14f
wang7776_-_vicuna-7b-v1.3-attention-sparsity-20-gguf
1 contributor
History:
5 commits
RichardErkhov
uploaded model
967c14f
verified
7 months ago
.gitattributes
1.86 kB
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ3_S.gguf
2.95 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.IQ3_XS.gguf
2.8 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q2_K.gguf
2.53 GB
LFS
uploaded model
7 months ago
vicuna-7b-v1.3-attention-sparsity-20.Q3_K_S.gguf
2.95 GB
LFS
uploaded model
7 months ago