qaihm-bot commited on
Commit
56c3940
·
verified ·
1 Parent(s): c6486fa

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +11 -4
README.md CHANGED
@@ -36,8 +36,8 @@ More details on model performance across various devices, can be found
36
 
37
  | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
38
  | ---|---|---|---|---|---|---|---|
39
- | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 602.022 ms | 0 - 428 MB | FP16 | GPU | [WhisperEncoder.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite)
40
- | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 45.479 ms | 8 - 11 MB | FP16 | NPU | [WhisperDecoder.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite)
41
 
42
 
43
  ## Installation
@@ -99,10 +99,17 @@ python -m qai_hub_models.models.whisper_small_en.export
99
  Profile Job summary of WhisperEncoder
100
  --------------------------------------------------
101
  Device: Samsung Galaxy S24 (14)
102
- Estimated Inference Time: 461.60 ms
103
- Estimated Peak Memory Range: 13.51-44.51 MB
104
  Compute Units: GPU (609) | Total (609)
105
 
 
 
 
 
 
 
 
106
 
107
  ```
108
  ## How does this work?
 
36
 
37
  | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
38
  | ---|---|---|---|---|---|---|---|
39
+ | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 598.326 ms | 91 - 511 MB | FP16 | GPU | [WhisperEncoder.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite)
40
+ | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 46.381 ms | 15 - 19 MB | FP16 | NPU | [WhisperDecoder.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite)
41
 
42
 
43
  ## Installation
 
99
  Profile Job summary of WhisperEncoder
100
  --------------------------------------------------
101
  Device: Samsung Galaxy S24 (14)
102
+ Estimated Inference Time: 469.35 ms
103
+ Estimated Peak Memory Range: 26.94-57.31 MB
104
  Compute Units: GPU (609) | Total (609)
105
 
106
+ Profile Job summary of WhisperDecoder
107
+ --------------------------------------------------
108
+ Device: Samsung Galaxy S24 (14)
109
+ Estimated Inference Time: 34.41 ms
110
+ Estimated Peak Memory Range: 19.25-1636.84 MB
111
+ Compute Units: NPU (903),CPU (2) | Total (905)
112
+
113
 
114
  ```
115
  ## How does this work?