YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Took 42 hours to quantize on 4xA40s, at a batch size of 128. I could've went higher, but hindsight. At that batch size, it was using about 25-30 GiB per GPU, utilization remained at 100%.
- Downloads last month
- 85
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.