This EXL2 quant matches the same bpw as mmnga's q4_K_M GGUF Like TheBloke, used shisa-en-ja-dpo-v1 dataset for calibration.
Main model: https://huggingface.co/augmxnt/shisa-7b-v1
For other quants (EXL2, AWQ, GGUF, etc) see: https://huggingface.co/augmxnt/shisa-7b-v1/discussions/2
- Downloads last month
- 12
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.