Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
rirv938
/
GPTQ-LLaMa-30B-4bit-triton-g128
like
0
Text Generation
Transformers
llama
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
GPTQ-LLaMa-30B-4bit-triton-g128
2 contributors
History:
2 commits
robert
add files
51ad9f1
over 1 year ago
.gitattributes
Safe
1.48 kB
initial commit
over 1 year ago
LLaMa-30B-GPTQ-4bit-g128.safetensors
Safe
17.5 GB
LFS
add files
over 1 year ago
config.json
Safe
503 Bytes
add files
over 1 year ago
generation_config.json
Safe
137 Bytes
add files
over 1 year ago
special_tokens_map.json
Safe
411 Bytes
add files
over 1 year ago
tokenizer.json
Safe
1.84 MB
add files
over 1 year ago
tokenizer_config.json
Safe
700 Bytes
add files
over 1 year ago