Update README.md
Browse files
README.md
CHANGED
@@ -47,3 +47,10 @@ We provide some qualitative comparison between FastHunyuan 6 step inference v.s.
|
|
47 |
| data:image/s3,"s3://crabby-images/227ae/227aea2e2df3b666eb8006882209ac687d90e60c" alt="FastHunyuan 6 step" | data:image/s3,"s3://crabby-images/9d80f/9d80f91d1f1c8593ce64989e7ea008a838ef5f73" alt="Hunyuan 6 step" |
|
48 |
| data:image/s3,"s3://crabby-images/36db9/36db9175fcbb0723391fdf98331cc43e645ef873" alt="FastHunyuan 6 step" | data:image/s3,"s3://crabby-images/819ec/819ec77c7a5ee1970f53628eecf7240fa83250d4" alt="Hunyuan 6 step" |
|
49 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
47 |
| data:image/s3,"s3://crabby-images/227ae/227aea2e2df3b666eb8006882209ac687d90e60c" alt="FastHunyuan 6 step" | data:image/s3,"s3://crabby-images/9d80f/9d80f91d1f1c8593ce64989e7ea008a838ef5f73" alt="Hunyuan 6 step" |
|
48 |
| data:image/s3,"s3://crabby-images/36db9/36db9175fcbb0723391fdf98331cc43e645ef873" alt="FastHunyuan 6 step" | data:image/s3,"s3://crabby-images/819ec/819ec77c7a5ee1970f53628eecf7240fa83250d4" alt="Hunyuan 6 step" |
|
49 |
|
50 |
+
## Memory requirements
|
51 |
+
|
52 |
+
For inference, we can inference FastHunyuan on single RTX4090. We now support NF4 and LLM-INT8 quantized inference using BitsAndBytes for FastHunyuan. With NF4 quantization, inference can be performed on a single RTX 4090 GPU, requiring just 20GB of VRAM.
|
53 |
+
|
54 |
+
For Lora Finetune, minimum hardware requirement
|
55 |
+
- 40 GB GPU memory each for 2 GPUs with lora
|
56 |
+
- 30 GB GPU memory each for 2 GPUs with CPU offload and lora.
|