oovword
/

whisper-uk2en-speech-translation

automatic-speech-recognition

speech-translation

Inference Endpoints

Model card Files Files and versions Community

oovword commited on Dec 23, 2024

Commit

0f7bb32

·

verified ·

1 Parent(s): 0dac06f

Update README.md

Files changed (1) hide show

README.md +26 -0

README.md CHANGED Viewed

@@ -111,6 +111,32 @@ The model has been fine-tuned on a mix of authentic human and synthetic speech a
 - num_train_epochs: 3 (975 training steps)
 - warmup_steps: 0
 ## License
 The fine-tuned model is licensed under the same Apache-2.0 license agreement as the original `openai/whisper-small` checkpoint.

 - num_train_epochs: 3 (975 training steps)
 - warmup_steps: 0
+The table below demonstrates the values of both training and validation losses as well as the BLEU score calculated on the development set during the fine-tuning. The model converged at step 900, or approximately epoch 3, and clearly started to overfit the dataset afterwards.
+| Step | Training loss | Validation loss | BLEU |
+| :---: | :---: | :---: | :---: |
+| 100 | 2.491100 | 2.007935 | 21.813000 |
+| 200 | 1.600800 | 1.383696 | 23.344800 |
+| 300 | 1.430900 | 1.309672 | 23.846300 |
+| 400 | 1.320600 | 1.268230 | 23.911000 |
+| 500 | 1.289200 | 1.248684 | 24.192300 |
+| 600 | 1.243800 | 1.239911 | 24.385900 |
+| 700 | 1.194200 | 1.207502 | 23.941100 |
+| 800 | 1.170800 | 1.211733 | 24.888100 |
+| 900 | 1.143800 | 1.199629 | 24.946900 |
+| 1000 | 1.153400 | 1.206929 | 24.919100 |
+| 1100 | 1.119200 | 1.201825 | 24.597300 |
+## Evaluation
+Both original and fine-tuned checkpoints have been evaluated on the test split of the dataset. The selected evaluation metrics are BLEU and ChrF++ implemented in `sacrebleu` library.
+| Model | BLEU | ChrF++ |
+| :---: | :---: | :---: |
+| `whisper-small` | 16.36 | 43.81 |
+| `checkpoint-900` | 22.34 | 48.1 |
+The fine-tuning improved the model's performance compared to the baseline score by almost 6 points.
 ## License
 The fine-tuned model is licensed under the same Apache-2.0 license agreement as the original `openai/whisper-small` checkpoint.