This model was converted to FP8 format from mistralai/Mistral-Small-Instruct-2409 using the llmcompressor library by vLLM. Refer to the original model card for more details on the model.

Downloads last month
32
Safetensors
Model size
22.3B params
Tensor type
FP16
·
F8_E4M3
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for tolgaakar/Mistral-Small-Instruct-2409-FP8-Dynamic

Quantized
(44)
this model