Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
TinyLlama-1.1B-Chat-v1.0-marlin
like
1
Follow
Neural Magic
308
Text Generation
Transformers
Safetensors
llama
nm-vllm
marlin
int4
conversational
text-generation-inference
Inference Endpoints
4-bit precision
gptq
arxiv:
2210.17323
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
TinyLlama-1.1B-Chat-v1.0-marlin
Commit History
Update quantization/apply_gptq_save_marlin.py
29715d8
verified
robertgshaw2
commited on
Mar 6, 2024
Update README.md
29e8c23
verified
robertgshaw2
commited on
Mar 6, 2024
Create requirements.txt
c7713ac
verified
robertgshaw2
commited on
Mar 6, 2024
Create quantization/apply_gptq_save_marlin.py
9d40424
verified
robertgshaw2
commited on
Mar 6, 2024
Update README.md
bd74ab9
verified
robertgshaw2
commited on
Mar 6, 2024
Create README.md
8680e42
verified
robertgshaw2
commited on
Mar 6, 2024
Upload folder using huggingface_hub
5059cf5
verified
mgoin
commited on
Mar 5, 2024
initial commit
15de72b
verified
mgoin
commited on
Mar 5, 2024