本模型使用RKLLM转换而来,部署教程见:RKLLM部署语言大模型教程
模型 | 内存占用 | 模型大小 | 量化类型 |
---|---|---|---|
Qwen2.5-3B-Instruct-RKLLM1.1.4 | 3.7GB | 3.48GB | w8a8 |
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
HF Inference API was unable to determine this model's library.