Padomin
/

t5-base-TEDxJP-8front-1body-8rear

+---
+license: cc-by-sa-4.0
+tags:
+- generated_from_trainer
+datasets:
+- te_dx_jp
+model-index:
+- name: t5-base-TEDxJP-8front-1body-8rear
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# t5-base-TEDxJP-8front-1body-8rear
+This model is a fine-tuned version of [sonoisa/t5-base-japanese](https://huggingface.co/sonoisa/t5-base-japanese) on the te_dx_jp dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4359
+- Wer: 0.1693
+- Mer: 0.1635
+- Wil: 0.2485
+- Wip: 0.7515
+- Hits: 55936
+- Substitutions: 6242
+- Deletions: 2409
+- Insertions: 2283
+- Cer: 0.1334
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 32
+- eval_batch_size: 32
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 10
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Wer    | Mer    | Wil    | Wip    | Hits  | Substitutions | Deletions | Insertions | Cer    |
+|:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:------:|:-----:|:-------------:|:---------:|:----------:|:------:|
+| 0.5891        | 1.0   | 1457  | 0.4696          | 0.2065 | 0.1954 | 0.2843 | 0.7157 | 54931 | 6665          | 2991      | 3684       | 0.1713 |
+| 0.4787        | 2.0   | 2914  | 0.4172          | 0.1767 | 0.1707 | 0.2566 | 0.7434 | 55432 | 6311          | 2844      | 2256       | 0.1417 |
+| 0.4425        | 3.0   | 4371  | 0.4078          | 0.1727 | 0.1667 | 0.2528 | 0.7472 | 55754 | 6342          | 2491      | 2320       | 0.1362 |
+| 0.3887        | 4.0   | 5828  | 0.4101          | 0.1718 | 0.1657 | 0.2521 | 0.7479 | 55850 | 6367          | 2370      | 2356       | 0.1346 |
+| 0.3567        | 5.0   | 7285  | 0.4120          | 0.1708 | 0.1648 | 0.2505 | 0.7495 | 55926 | 6315          | 2346      | 2371       | 0.1333 |
+| 0.3157        | 6.0   | 8742  | 0.4174          | 0.1710 | 0.1652 | 0.2510 | 0.7490 | 55805 | 6313          | 2469      | 2260       | 0.1346 |
+| 0.2958        | 7.0   | 10199 | 0.4277          | 0.1700 | 0.1641 | 0.2493 | 0.7507 | 55934 | 6263          | 2390      | 2330       | 0.1339 |
+| 0.3088        | 8.0   | 11656 | 0.4268          | 0.1687 | 0.1631 | 0.2483 | 0.7517 | 55890 | 6252          | 2445      | 2198       | 0.1329 |
+| 0.2725        | 9.0   | 13113 | 0.4352          | 0.1697 | 0.1639 | 0.2490 | 0.7510 | 55911 | 6247          | 2429      | 2287       | 0.1329 |
+| 0.2588        | 10.0  | 14570 | 0.4359          | 0.1693 | 0.1635 | 0.2485 | 0.7515 | 55936 | 6242          | 2409      | 2283       | 0.1334 |
+### Framework versions
+- Transformers 4.21.2
+- Pytorch 1.12.1+cu116
+- Datasets 2.4.0
+- Tokenizers 0.12.1