Tiance Wang commited on
Commit
2c2e885
·
1 Parent(s): 249fb17

upload models and results

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. data/lang_bpe_500/bpe.model +3 -0
  2. data/lang_phone/uniq_lexicon.txt +0 -0
  3. large_bpe_500/decode_results/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
  4. large_bpe_500/decode_results/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
  5. large_bpe_500/decode_results/fast_beam_search/log-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam-2023-01-16-09-53-08 +107 -0
  6. large_bpe_500/decode_results/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
  7. large_bpe_500/decode_results/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
  8. large_bpe_500/decode_results/fast_beam_search/wer-summary-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +2 -0
  9. large_bpe_500/decode_results/fast_beam_search/wer-summary-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +2 -0
  10. large_bpe_500/decode_results/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +0 -0
  11. large_bpe_500/decode_results/greedy_search/errs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +0 -0
  12. large_bpe_500/decode_results/greedy_search/log-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam-2023-01-16-09-52-16 +103 -0
  13. large_bpe_500/decode_results/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +0 -0
  14. large_bpe_500/decode_results/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +0 -0
  15. large_bpe_500/decode_results/greedy_search/wer-summary-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +2 -0
  16. large_bpe_500/decode_results/greedy_search/wer-summary-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +2 -0
  17. large_bpe_500/decode_results/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +0 -0
  18. large_bpe_500/decode_results/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +0 -0
  19. large_bpe_500/decode_results/modified_beam_search/log-epoch-30-avg-1-modified_beam_search-beam-size-4-uam-2023-01-16-09-54-54 +107 -0
  20. large_bpe_500/decode_results/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +0 -0
  21. large_bpe_500/decode_results/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +0 -0
  22. large_bpe_500/decode_results/modified_beam_search/wer-summary-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +2 -0
  23. large_bpe_500/decode_results/modified_beam_search/wer-summary-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +2 -0
  24. large_bpe_500/exp/cpu_jit.pt +3 -0
  25. large_bpe_500/exp/log/log-train-2023-01-09-06-23-27 +0 -0
  26. large_bpe_500/exp/pretrained.pt +3 -0
  27. large_bpe_500/exp/tensorboard/events.out.tfevents.1673245407.kao-dgxa-f12-u17.4075883.0 +3 -0
  28. middle_bpe_500/decode_results/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
  29. middle_bpe_500/decode_results/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
  30. middle_bpe_500/decode_results/fast_beam_search/log-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam-2023-01-09-01-15-28 +107 -0
  31. middle_bpe_500/decode_results/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
  32. middle_bpe_500/decode_results/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
  33. middle_bpe_500/decode_results/fast_beam_search/wer-summary-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +2 -0
  34. middle_bpe_500/decode_results/fast_beam_search/wer-summary-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +2 -0
  35. middle_bpe_500/decode_results/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +0 -0
  36. middle_bpe_500/decode_results/greedy_search/errs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +0 -0
  37. middle_bpe_500/decode_results/greedy_search/log-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam-2023-01-09-01-12-47 +103 -0
  38. middle_bpe_500/decode_results/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +0 -0
  39. middle_bpe_500/decode_results/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +0 -0
  40. middle_bpe_500/decode_results/greedy_search/wer-summary-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +2 -0
  41. middle_bpe_500/decode_results/greedy_search/wer-summary-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +2 -0
  42. middle_bpe_500/decode_results/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +0 -0
  43. middle_bpe_500/decode_results/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +0 -0
  44. middle_bpe_500/decode_results/modified_beam_search/log-epoch-30-avg-2-modified_beam_search-beam-size-4-uam-2023-01-09-01-17-06 +107 -0
  45. middle_bpe_500/decode_results/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +0 -0
  46. middle_bpe_500/decode_results/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +0 -0
  47. middle_bpe_500/decode_results/modified_beam_search/wer-summary-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +2 -0
  48. middle_bpe_500/decode_results/modified_beam_search/wer-summary-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +2 -0
  49. middle_bpe_500/exp/cpu_jit.pt +3 -0
  50. middle_bpe_500/exp/log/log-train-2023-01-06-07-16-15 +0 -0
data/lang_bpe_500/bpe.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5db9f109de8776c78b1a52bc69a9694acc09bb4c11373e61794e20b24f6e244d
3
+ size 244891
data/lang_phone/uniq_lexicon.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/fast_beam_search/log-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam-2023-01-16-09-53-08 ADDED
@@ -0,0 +1,107 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2023-01-16 09:53:08,412 INFO [decode.py:565] Decoding started
2
+ 2023-01-16 09:53:08,412 INFO [decode.py:571] Device: cuda:0
3
+ 2023-01-16 09:53:08,589 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
4
+ 2023-01-16 09:53:08,631 INFO [decode.py:588] { 'activation': 'doubleswish',
5
+ 'avg': 1,
6
+ 'batch_idx_train': 0,
7
+ 'beam': 20.0,
8
+ 'beam_size': 4,
9
+ 'best_train_epoch': -1,
10
+ 'best_train_loss': inf,
11
+ 'best_valid_epoch': -1,
12
+ 'best_valid_loss': inf,
13
+ 'blank_id': 0,
14
+ 'bucketing_sampler': True,
15
+ 'channels': 400,
16
+ 'concatenate_cuts': False,
17
+ 'context_size': 2,
18
+ 'conv_layers': 18,
19
+ 'decoder_dim': 400,
20
+ 'decoding_method': 'fast_beam_search',
21
+ 'drop_last': True,
22
+ 'duration_factor': 1.0,
23
+ 'enable_musan': True,
24
+ 'enable_spec_aug': True,
25
+ 'encoder_dim': 400,
26
+ 'env_info': { 'IP address': '127.0.1.1',
27
+ 'hostname': 'kao-dgxa-f12-u17',
28
+ 'icefall-git-branch': 'tiny',
29
+ 'icefall-git-date': 'Fri Jan 13 07:21:29 2023',
30
+ 'icefall-git-sha1': '5c8e962-dirty',
31
+ 'icefall-path': '/home/jsong/git/icefall',
32
+ 'k2-build-type': 'Release',
33
+ 'k2-git-date': 'Fri Nov 25 08:23:51 2022',
34
+ 'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
35
+ 'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
36
+ 'k2-version': '1.23.2',
37
+ 'k2-with-cuda': True,
38
+ 'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
39
+ 'lhotse-version': '1.7.0',
40
+ 'python-version': '3.9',
41
+ 'torch-cuda-available': True,
42
+ 'torch-cuda-version': '11.3',
43
+ 'torch-version': '1.12.0'},
44
+ 'epoch': 30,
45
+ 'exp_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug'),
46
+ 'feature_dim': 80,
47
+ 'full_libri': True,
48
+ 'gap': 1.0,
49
+ 'input_strategy': 'PrecomputedFeatures',
50
+ 'iter': 0,
51
+ 'joiner_dim': 400,
52
+ 'lang_dir': PosixPath('data/lang_bpe_500'),
53
+ 'log_interval': 500,
54
+ 'manifest_dir': PosixPath('data/fbank'),
55
+ 'max_contexts': 8,
56
+ 'max_duration': 600,
57
+ 'max_states': 64,
58
+ 'max_sym_per_frame': 1,
59
+ 'nbest_scale': 0.5,
60
+ 'ngram_lm_scale': 0.1,
61
+ 'num_buckets': 30,
62
+ 'num_paths': 100,
63
+ 'num_workers': 2,
64
+ 'on_the_fly_feats': False,
65
+ 'res_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search'),
66
+ 'reset_interval': 200,
67
+ 'return_cuts': True,
68
+ 'shuffle': True,
69
+ 'skip_add': True,
70
+ 'spec_aug_time_warp_factor': 80,
71
+ 'subsampling_factor': 4,
72
+ 'suffix': 'epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam',
73
+ 'unk_id': 2,
74
+ 'use_averaged_model': True,
75
+ 'use_double_scores': True,
76
+ 'use_dscnn': True,
77
+ 'valid_interval': 9000,
78
+ 'vocab_size': 500,
79
+ 'warm_step': 5000}
80
+ 2023-01-16 09:53:08,631 INFO [decode.py:590] About to create model
81
+ 2023-01-16 09:53:11,209 INFO [train.py:426] Encoder MAC ops for 10 seconds of audio is 888.44M
82
+ 2023-01-16 09:53:11,213 INFO [decode.py:659] Calculating the averaged model over epoch range from 29 (excluded) to 30
83
+ 2023-01-16 09:53:11,613 INFO [decode.py:697] Number of model parameters: 4821330
84
+ 2023-01-16 09:53:11,614 INFO [decode.py:698] Parameters for transducer decoding: 4219830
85
+ 2023-01-16 09:53:11,614 INFO [asr_datamodule.py:449] About to get test-clean cuts
86
+ 2023-01-16 09:53:11,617 INFO [asr_datamodule.py:456] About to get test-other cuts
87
+ 2023-01-16 09:53:13,953 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
88
+ 2023-01-16 09:53:32,617 INFO [decode.py:459] batch 20/?, cuts processed until now is 1430
89
+ 2023-01-16 09:53:53,226 INFO [decode.py:459] batch 40/?, cuts processed until now is 2561
90
+ 2023-01-16 09:53:53,817 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
91
+ 2023-01-16 09:53:53,883 INFO [utils.py:536] [test-clean-beam_20.0_max_contexts_8_max_states_64] %WER 7.91% [4160 / 52576, 482 ins, 411 del, 3267 sub ]
92
+ 2023-01-16 09:53:54,026 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
93
+ 2023-01-16 09:53:54,026 INFO [decode.py:508]
94
+ For test-clean, WER of different settings are:
95
+ beam_20.0_max_contexts_8_max_states_64 7.91 best for test-clean
96
+
97
+ 2023-01-16 09:53:55,228 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
98
+ 2023-01-16 09:54:12,536 INFO [decode.py:459] batch 20/?, cuts processed until now is 1646
99
+ 2023-01-16 09:54:31,673 INFO [decode.py:459] batch 40/?, cuts processed until now is 2870
100
+ 2023-01-16 09:54:32,255 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
101
+ 2023-01-16 09:54:32,325 INFO [utils.py:536] [test-other-beam_20.0_max_contexts_8_max_states_64] %WER 20.10% [10521 / 52343, 1011 ins, 1417 del, 8093 sub ]
102
+ 2023-01-16 09:54:32,481 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
103
+ 2023-01-16 09:54:32,481 INFO [decode.py:508]
104
+ For test-other, WER of different settings are:
105
+ beam_20.0_max_contexts_8_max_states_64 20.1 best for test-other
106
+
107
+ 2023-01-16 09:54:32,481 INFO [decode.py:730] Done!
large_bpe_500/decode_results/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/fast_beam_search/wer-summary-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ beam_20.0_max_contexts_8_max_states_64 7.91
large_bpe_500/decode_results/fast_beam_search/wer-summary-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ beam_20.0_max_contexts_8_max_states_64 20.1
large_bpe_500/decode_results/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/greedy_search/errs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/greedy_search/log-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam-2023-01-16-09-52-16 ADDED
@@ -0,0 +1,103 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2023-01-16 09:52:16,827 INFO [decode.py:565] Decoding started
2
+ 2023-01-16 09:52:16,827 INFO [decode.py:571] Device: cuda:0
3
+ 2023-01-16 09:52:17,018 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
4
+ 2023-01-16 09:52:17,057 INFO [decode.py:588] { 'activation': 'doubleswish',
5
+ 'avg': 1,
6
+ 'batch_idx_train': 0,
7
+ 'beam': 20.0,
8
+ 'beam_size': 4,
9
+ 'best_train_epoch': -1,
10
+ 'best_train_loss': inf,
11
+ 'best_valid_epoch': -1,
12
+ 'best_valid_loss': inf,
13
+ 'blank_id': 0,
14
+ 'bucketing_sampler': True,
15
+ 'channels': 400,
16
+ 'concatenate_cuts': False,
17
+ 'context_size': 2,
18
+ 'conv_layers': 18,
19
+ 'decoder_dim': 400,
20
+ 'decoding_method': 'greedy_search',
21
+ 'drop_last': True,
22
+ 'duration_factor': 1.0,
23
+ 'enable_musan': True,
24
+ 'enable_spec_aug': True,
25
+ 'encoder_dim': 400,
26
+ 'env_info': { 'IP address': '127.0.1.1',
27
+ 'hostname': 'kao-dgxa-f12-u17',
28
+ 'icefall-git-branch': 'tiny',
29
+ 'icefall-git-date': 'Fri Jan 13 07:21:29 2023',
30
+ 'icefall-git-sha1': '5c8e962-dirty',
31
+ 'icefall-path': '/home/jsong/git/icefall',
32
+ 'k2-build-type': 'Release',
33
+ 'k2-git-date': 'Fri Nov 25 08:23:51 2022',
34
+ 'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
35
+ 'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
36
+ 'k2-version': '1.23.2',
37
+ 'k2-with-cuda': True,
38
+ 'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
39
+ 'lhotse-version': '1.7.0',
40
+ 'python-version': '3.9',
41
+ 'torch-cuda-available': True,
42
+ 'torch-cuda-version': '11.3',
43
+ 'torch-version': '1.12.0'},
44
+ 'epoch': 30,
45
+ 'exp_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug'),
46
+ 'feature_dim': 80,
47
+ 'full_libri': True,
48
+ 'gap': 1.0,
49
+ 'input_strategy': 'PrecomputedFeatures',
50
+ 'iter': 0,
51
+ 'joiner_dim': 400,
52
+ 'lang_dir': PosixPath('data/lang_bpe_500'),
53
+ 'log_interval': 500,
54
+ 'manifest_dir': PosixPath('data/fbank'),
55
+ 'max_contexts': 8,
56
+ 'max_duration': 600,
57
+ 'max_states': 64,
58
+ 'max_sym_per_frame': 1,
59
+ 'nbest_scale': 0.5,
60
+ 'ngram_lm_scale': 0.1,
61
+ 'num_buckets': 30,
62
+ 'num_paths': 100,
63
+ 'num_workers': 2,
64
+ 'on_the_fly_feats': False,
65
+ 'res_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search'),
66
+ 'reset_interval': 200,
67
+ 'return_cuts': True,
68
+ 'shuffle': True,
69
+ 'skip_add': True,
70
+ 'spec_aug_time_warp_factor': 80,
71
+ 'subsampling_factor': 4,
72
+ 'suffix': 'epoch-30-avg-1-context-2-max-sym-per-frame-1-uam',
73
+ 'unk_id': 2,
74
+ 'use_averaged_model': True,
75
+ 'use_double_scores': True,
76
+ 'use_dscnn': True,
77
+ 'valid_interval': 9000,
78
+ 'vocab_size': 500,
79
+ 'warm_step': 5000}
80
+ 2023-01-16 09:52:17,058 INFO [decode.py:590] About to create model
81
+ 2023-01-16 09:52:19,589 INFO [train.py:426] Encoder MAC ops for 10 seconds of audio is 888.44M
82
+ 2023-01-16 09:52:19,594 INFO [decode.py:659] Calculating the averaged model over epoch range from 29 (excluded) to 30
83
+ 2023-01-16 09:52:19,977 INFO [decode.py:697] Number of model parameters: 4821330
84
+ 2023-01-16 09:52:19,977 INFO [decode.py:698] Parameters for transducer decoding: 4219830
85
+ 2023-01-16 09:52:19,977 INFO [asr_datamodule.py:449] About to get test-clean cuts
86
+ 2023-01-16 09:52:19,978 INFO [asr_datamodule.py:456] About to get test-other cuts
87
+ 2023-01-16 09:52:21,770 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
88
+ 2023-01-16 09:52:39,401 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
89
+ 2023-01-16 09:52:39,467 INFO [utils.py:536] [test-clean-greedy_search] %WER 8.29% [4359 / 52576, 501 ins, 443 del, 3415 sub ]
90
+ 2023-01-16 09:52:39,617 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
91
+ 2023-01-16 09:52:39,618 INFO [decode.py:508]
92
+ For test-clean, WER of different settings are:
93
+ greedy_search 8.29 best for test-clean
94
+
95
+ 2023-01-16 09:52:40,281 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
96
+ 2023-01-16 09:52:56,315 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
97
+ 2023-01-16 09:52:56,386 INFO [utils.py:536] [test-other-greedy_search] %WER 21.11% [11052 / 52343, 1006 ins, 1534 del, 8512 sub ]
98
+ 2023-01-16 09:52:56,547 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/errs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
99
+ 2023-01-16 09:52:56,547 INFO [decode.py:508]
100
+ For test-other, WER of different settings are:
101
+ greedy_search 21.11 best for test-other
102
+
103
+ 2023-01-16 09:52:56,547 INFO [decode.py:730] Done!
large_bpe_500/decode_results/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/greedy_search/wer-summary-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ greedy_search 8.29
large_bpe_500/decode_results/greedy_search/wer-summary-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ greedy_search 21.11
large_bpe_500/decode_results/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/modified_beam_search/log-epoch-30-avg-1-modified_beam_search-beam-size-4-uam-2023-01-16-09-54-54 ADDED
@@ -0,0 +1,107 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2023-01-16 09:54:54,538 INFO [decode.py:565] Decoding started
2
+ 2023-01-16 09:54:54,538 INFO [decode.py:571] Device: cuda:0
3
+ 2023-01-16 09:54:54,718 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
4
+ 2023-01-16 09:54:54,758 INFO [decode.py:588] { 'activation': 'doubleswish',
5
+ 'avg': 1,
6
+ 'batch_idx_train': 0,
7
+ 'beam': 20.0,
8
+ 'beam_size': 4,
9
+ 'best_train_epoch': -1,
10
+ 'best_train_loss': inf,
11
+ 'best_valid_epoch': -1,
12
+ 'best_valid_loss': inf,
13
+ 'blank_id': 0,
14
+ 'bucketing_sampler': True,
15
+ 'channels': 400,
16
+ 'concatenate_cuts': False,
17
+ 'context_size': 2,
18
+ 'conv_layers': 18,
19
+ 'decoder_dim': 400,
20
+ 'decoding_method': 'modified_beam_search',
21
+ 'drop_last': True,
22
+ 'duration_factor': 1.0,
23
+ 'enable_musan': True,
24
+ 'enable_spec_aug': True,
25
+ 'encoder_dim': 400,
26
+ 'env_info': { 'IP address': '127.0.1.1',
27
+ 'hostname': 'kao-dgxa-f12-u17',
28
+ 'icefall-git-branch': 'tiny',
29
+ 'icefall-git-date': 'Fri Jan 13 07:21:29 2023',
30
+ 'icefall-git-sha1': '5c8e962-dirty',
31
+ 'icefall-path': '/home/jsong/git/icefall',
32
+ 'k2-build-type': 'Release',
33
+ 'k2-git-date': 'Fri Nov 25 08:23:51 2022',
34
+ 'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
35
+ 'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
36
+ 'k2-version': '1.23.2',
37
+ 'k2-with-cuda': True,
38
+ 'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
39
+ 'lhotse-version': '1.7.0',
40
+ 'python-version': '3.9',
41
+ 'torch-cuda-available': True,
42
+ 'torch-cuda-version': '11.3',
43
+ 'torch-version': '1.12.0'},
44
+ 'epoch': 30,
45
+ 'exp_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug'),
46
+ 'feature_dim': 80,
47
+ 'full_libri': True,
48
+ 'gap': 1.0,
49
+ 'input_strategy': 'PrecomputedFeatures',
50
+ 'iter': 0,
51
+ 'joiner_dim': 400,
52
+ 'lang_dir': PosixPath('data/lang_bpe_500'),
53
+ 'log_interval': 500,
54
+ 'manifest_dir': PosixPath('data/fbank'),
55
+ 'max_contexts': 8,
56
+ 'max_duration': 600,
57
+ 'max_states': 64,
58
+ 'max_sym_per_frame': 1,
59
+ 'nbest_scale': 0.5,
60
+ 'ngram_lm_scale': 0.1,
61
+ 'num_buckets': 30,
62
+ 'num_paths': 100,
63
+ 'num_workers': 2,
64
+ 'on_the_fly_feats': False,
65
+ 'res_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search'),
66
+ 'reset_interval': 200,
67
+ 'return_cuts': True,
68
+ 'shuffle': True,
69
+ 'skip_add': True,
70
+ 'spec_aug_time_warp_factor': 80,
71
+ 'subsampling_factor': 4,
72
+ 'suffix': 'epoch-30-avg-1-modified_beam_search-beam-size-4-uam',
73
+ 'unk_id': 2,
74
+ 'use_averaged_model': True,
75
+ 'use_double_scores': True,
76
+ 'use_dscnn': True,
77
+ 'valid_interval': 9000,
78
+ 'vocab_size': 500,
79
+ 'warm_step': 5000}
80
+ 2023-01-16 09:54:54,758 INFO [decode.py:590] About to create model
81
+ 2023-01-16 09:54:57,321 INFO [train.py:426] Encoder MAC ops for 10 seconds of audio is 888.44M
82
+ 2023-01-16 09:54:57,325 INFO [decode.py:659] Calculating the averaged model over epoch range from 29 (excluded) to 30
83
+ 2023-01-16 09:54:57,706 INFO [decode.py:697] Number of model parameters: 4821330
84
+ 2023-01-16 09:54:57,706 INFO [decode.py:698] Parameters for transducer decoding: 4219830
85
+ 2023-01-16 09:54:57,706 INFO [asr_datamodule.py:449] About to get test-clean cuts
86
+ 2023-01-16 09:54:57,707 INFO [asr_datamodule.py:456] About to get test-other cuts
87
+ 2023-01-16 09:55:01,888 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
88
+ 2023-01-16 09:55:55,346 INFO [decode.py:459] batch 20/?, cuts processed until now is 1430
89
+ 2023-01-16 09:56:40,761 INFO [decode.py:459] batch 40/?, cuts processed until now is 2561
90
+ 2023-01-16 09:56:42,416 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
91
+ 2023-01-16 09:56:42,520 INFO [utils.py:536] [test-clean-beam_size_4] %WER 7.74% [4072 / 52576, 492 ins, 384 del, 3196 sub ]
92
+ 2023-01-16 09:56:42,664 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
93
+ 2023-01-16 09:56:42,664 INFO [decode.py:508]
94
+ For test-clean, WER of different settings are:
95
+ beam_size_4 7.74 best for test-clean
96
+
97
+ 2023-01-16 09:56:45,681 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
98
+ 2023-01-16 09:57:37,910 INFO [decode.py:459] batch 20/?, cuts processed until now is 1646
99
+ 2023-01-16 09:58:20,895 INFO [decode.py:459] batch 40/?, cuts processed until now is 2870
100
+ 2023-01-16 09:58:22,601 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
101
+ 2023-01-16 09:58:22,672 INFO [utils.py:536] [test-other-beam_size_4] %WER 19.89% [10412 / 52343, 1054 ins, 1301 del, 8057 sub ]
102
+ 2023-01-16 09:58:22,832 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
103
+ 2023-01-16 09:58:22,832 INFO [decode.py:508]
104
+ For test-other, WER of different settings are:
105
+ beam_size_4 19.89 best for test-other
106
+
107
+ 2023-01-16 09:58:22,832 INFO [decode.py:730] Done!
large_bpe_500/decode_results/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/decode_results/modified_beam_search/wer-summary-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ beam_size_4 7.74
large_bpe_500/decode_results/modified_beam_search/wer-summary-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ beam_size_4 19.89
large_bpe_500/exp/cpu_jit.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:20897e2653f7130fc2c1a52621cf01bb990010659c886124c961620c58d99d5f
3
+ size 19576790
large_bpe_500/exp/log/log-train-2023-01-09-06-23-27 ADDED
The diff for this file is too large to render. See raw diff
 
large_bpe_500/exp/pretrained.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:be438679835e07dc63718fe490fd838cc532d04f0e51248314f116676bd10851
3
+ size 19539685
large_bpe_500/exp/tensorboard/events.out.tfevents.1673245407.kao-dgxa-f12-u17.4075883.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7694fbf23c8fd399fd58b484a3837fa8f38ac8db1a098dd40e2512677a338e04
3
+ size 618506
middle_bpe_500/decode_results/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/fast_beam_search/log-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam-2023-01-09-01-15-28 ADDED
@@ -0,0 +1,107 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2023-01-09 01:15:28,211 INFO [decode.py:565] Decoding started
2
+ 2023-01-09 01:15:28,211 INFO [decode.py:571] Device: cuda:0
3
+ 2023-01-09 01:15:28,390 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
4
+ 2023-01-09 01:15:28,425 INFO [decode.py:588] { 'activation': 'doubleswish',
5
+ 'avg': 2,
6
+ 'batch_idx_train': 0,
7
+ 'beam': 20.0,
8
+ 'beam_size': 4,
9
+ 'best_train_epoch': -1,
10
+ 'best_train_loss': inf,
11
+ 'best_valid_epoch': -1,
12
+ 'best_valid_loss': inf,
13
+ 'blank_id': 0,
14
+ 'bucketing_sampler': True,
15
+ 'channels': 300,
16
+ 'concatenate_cuts': False,
17
+ 'context_size': 2,
18
+ 'conv_layers': 18,
19
+ 'decoder_dim': 256,
20
+ 'decoding_method': 'fast_beam_search',
21
+ 'drop_last': True,
22
+ 'duration_factor': 1.0,
23
+ 'enable_musan': True,
24
+ 'enable_spec_aug': True,
25
+ 'encoder_dim': 256,
26
+ 'env_info': { 'IP address': '127.0.1.1',
27
+ 'hostname': 'kao-dgxa-f12-u17',
28
+ 'icefall-git-branch': 'tiny',
29
+ 'icefall-git-date': 'Mon Jan 2 00:08:32 2023',
30
+ 'icefall-git-sha1': '2fd970b-dirty',
31
+ 'icefall-path': '/home/jsong/git/icefall',
32
+ 'k2-build-type': 'Release',
33
+ 'k2-git-date': 'Fri Nov 25 08:23:51 2022',
34
+ 'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
35
+ 'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
36
+ 'k2-version': '1.23.2',
37
+ 'k2-with-cuda': True,
38
+ 'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
39
+ 'lhotse-version': '1.7.0',
40
+ 'python-version': '3.9',
41
+ 'torch-cuda-available': True,
42
+ 'torch-cuda-version': '11.3',
43
+ 'torch-version': '1.12.0'},
44
+ 'epoch': 30,
45
+ 'exp_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay'),
46
+ 'feature_dim': 80,
47
+ 'full_libri': True,
48
+ 'gap': 1.0,
49
+ 'input_strategy': 'PrecomputedFeatures',
50
+ 'iter': 0,
51
+ 'joiner_dim': 256,
52
+ 'lang_dir': PosixPath('data/lang_bpe_500'),
53
+ 'log_interval': 500,
54
+ 'manifest_dir': PosixPath('data/fbank'),
55
+ 'max_contexts': 8,
56
+ 'max_duration': 600,
57
+ 'max_states': 64,
58
+ 'max_sym_per_frame': 1,
59
+ 'nbest_scale': 0.5,
60
+ 'ngram_lm_scale': 0.1,
61
+ 'num_buckets': 30,
62
+ 'num_paths': 100,
63
+ 'num_workers': 2,
64
+ 'on_the_fly_feats': False,
65
+ 'res_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search'),
66
+ 'reset_interval': 200,
67
+ 'return_cuts': True,
68
+ 'shuffle': True,
69
+ 'skip_add': True,
70
+ 'spec_aug_time_warp_factor': 80,
71
+ 'subsampling_factor': 4,
72
+ 'suffix': 'epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam',
73
+ 'unk_id': 2,
74
+ 'use_averaged_model': True,
75
+ 'use_double_scores': True,
76
+ 'use_dscnn': True,
77
+ 'valid_interval': 9000,
78
+ 'vocab_size': 500,
79
+ 'warm_step': 5000}
80
+ 2023-01-09 01:15:28,425 INFO [decode.py:590] About to create model
81
+ 2023-01-09 01:15:30,875 INFO [train.py:425] Encoder MAC ops for 10 seconds of audio is 501.07M
82
+ 2023-01-09 01:15:30,878 INFO [decode.py:659] Calculating the averaged model over epoch range from 28 (excluded) to 30
83
+ 2023-01-09 01:15:31,196 INFO [decode.py:697] Number of model parameters: 2735794
84
+ 2023-01-09 01:15:31,196 INFO [decode.py:698] Parameters for transducer decoding: 2350294
85
+ 2023-01-09 01:15:31,196 INFO [asr_datamodule.py:443] About to get test-clean cuts
86
+ 2023-01-09 01:15:31,197 INFO [asr_datamodule.py:450] About to get test-other cuts
87
+ 2023-01-09 01:15:33,792 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
88
+ 2023-01-09 01:15:52,777 INFO [decode.py:459] batch 20/?, cuts processed until now is 1430
89
+ 2023-01-09 01:16:13,995 INFO [decode.py:459] batch 40/?, cuts processed until now is 2561
90
+ 2023-01-09 01:16:14,590 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
91
+ 2023-01-09 01:16:14,663 INFO [utils.py:536] [test-clean-beam_20.0_max_contexts_8_max_states_64] %WER 9.69% [5096 / 52576, 599 ins, 487 del, 4010 sub ]
92
+ 2023-01-09 01:16:14,812 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
93
+ 2023-01-09 01:16:14,812 INFO [decode.py:508]
94
+ For test-clean, WER of different settings are:
95
+ beam_20.0_max_contexts_8_max_states_64 9.69 best for test-clean
96
+
97
+ 2023-01-09 01:16:16,043 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
98
+ 2023-01-09 01:16:33,449 INFO [decode.py:459] batch 20/?, cuts processed until now is 1646
99
+ 2023-01-09 01:16:53,170 INFO [decode.py:459] batch 40/?, cuts processed until now is 2870
100
+ 2023-01-09 01:16:53,753 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
101
+ 2023-01-09 01:16:53,825 INFO [utils.py:536] [test-other-beam_20.0_max_contexts_8_max_states_64] %WER 23.58% [12345 / 52343, 1208 ins, 1744 del, 9393 sub ]
102
+ 2023-01-09 01:16:53,988 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
103
+ 2023-01-09 01:16:53,988 INFO [decode.py:508]
104
+ For test-other, WER of different settings are:
105
+ beam_20.0_max_contexts_8_max_states_64 23.58 best for test-other
106
+
107
+ 2023-01-09 01:16:53,988 INFO [decode.py:730] Done!
middle_bpe_500/decode_results/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/fast_beam_search/wer-summary-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ beam_20.0_max_contexts_8_max_states_64 9.69
middle_bpe_500/decode_results/fast_beam_search/wer-summary-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ beam_20.0_max_contexts_8_max_states_64 23.58
middle_bpe_500/decode_results/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/greedy_search/errs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/greedy_search/log-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam-2023-01-09-01-12-47 ADDED
@@ -0,0 +1,103 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2023-01-09 01:12:47,525 INFO [decode.py:565] Decoding started
2
+ 2023-01-09 01:12:47,525 INFO [decode.py:571] Device: cuda:0
3
+ 2023-01-09 01:12:47,708 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
4
+ 2023-01-09 01:12:47,749 INFO [decode.py:588] { 'activation': 'doubleswish',
5
+ 'avg': 2,
6
+ 'batch_idx_train': 0,
7
+ 'beam': 20.0,
8
+ 'beam_size': 4,
9
+ 'best_train_epoch': -1,
10
+ 'best_train_loss': inf,
11
+ 'best_valid_epoch': -1,
12
+ 'best_valid_loss': inf,
13
+ 'blank_id': 0,
14
+ 'bucketing_sampler': True,
15
+ 'channels': 300,
16
+ 'concatenate_cuts': False,
17
+ 'context_size': 2,
18
+ 'conv_layers': 18,
19
+ 'decoder_dim': 256,
20
+ 'decoding_method': 'greedy_search',
21
+ 'drop_last': True,
22
+ 'duration_factor': 1.0,
23
+ 'enable_musan': True,
24
+ 'enable_spec_aug': True,
25
+ 'encoder_dim': 256,
26
+ 'env_info': { 'IP address': '127.0.1.1',
27
+ 'hostname': 'kao-dgxa-f12-u17',
28
+ 'icefall-git-branch': 'tiny',
29
+ 'icefall-git-date': 'Mon Jan 2 00:08:32 2023',
30
+ 'icefall-git-sha1': '2fd970b-dirty',
31
+ 'icefall-path': '/home/jsong/git/icefall',
32
+ 'k2-build-type': 'Release',
33
+ 'k2-git-date': 'Fri Nov 25 08:23:51 2022',
34
+ 'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
35
+ 'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
36
+ 'k2-version': '1.23.2',
37
+ 'k2-with-cuda': True,
38
+ 'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
39
+ 'lhotse-version': '1.7.0',
40
+ 'python-version': '3.9',
41
+ 'torch-cuda-available': True,
42
+ 'torch-cuda-version': '11.3',
43
+ 'torch-version': '1.12.0'},
44
+ 'epoch': 30,
45
+ 'exp_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay'),
46
+ 'feature_dim': 80,
47
+ 'full_libri': True,
48
+ 'gap': 1.0,
49
+ 'input_strategy': 'PrecomputedFeatures',
50
+ 'iter': 0,
51
+ 'joiner_dim': 256,
52
+ 'lang_dir': PosixPath('data/lang_bpe_500'),
53
+ 'log_interval': 500,
54
+ 'manifest_dir': PosixPath('data/fbank'),
55
+ 'max_contexts': 8,
56
+ 'max_duration': 600,
57
+ 'max_states': 64,
58
+ 'max_sym_per_frame': 1,
59
+ 'nbest_scale': 0.5,
60
+ 'ngram_lm_scale': 0.1,
61
+ 'num_buckets': 30,
62
+ 'num_paths': 100,
63
+ 'num_workers': 2,
64
+ 'on_the_fly_feats': False,
65
+ 'res_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search'),
66
+ 'reset_interval': 200,
67
+ 'return_cuts': True,
68
+ 'shuffle': True,
69
+ 'skip_add': True,
70
+ 'spec_aug_time_warp_factor': 80,
71
+ 'subsampling_factor': 4,
72
+ 'suffix': 'epoch-30-avg-2-context-2-max-sym-per-frame-1-uam',
73
+ 'unk_id': 2,
74
+ 'use_averaged_model': True,
75
+ 'use_double_scores': True,
76
+ 'use_dscnn': True,
77
+ 'valid_interval': 9000,
78
+ 'vocab_size': 500,
79
+ 'warm_step': 5000}
80
+ 2023-01-09 01:12:47,750 INFO [decode.py:590] About to create model
81
+ 2023-01-09 01:12:50,232 INFO [train.py:425] Encoder MAC ops for 10 seconds of audio is 501.07M
82
+ 2023-01-09 01:12:50,235 INFO [decode.py:659] Calculating the averaged model over epoch range from 28 (excluded) to 30
83
+ 2023-01-09 01:12:50,572 INFO [decode.py:697] Number of model parameters: 2735794
84
+ 2023-01-09 01:12:50,572 INFO [decode.py:698] Parameters for transducer decoding: 2350294
85
+ 2023-01-09 01:12:50,572 INFO [asr_datamodule.py:443] About to get test-clean cuts
86
+ 2023-01-09 01:12:50,574 INFO [asr_datamodule.py:450] About to get test-other cuts
87
+ 2023-01-09 01:12:52,369 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
88
+ 2023-01-09 01:13:09,470 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
89
+ 2023-01-09 01:13:09,537 INFO [utils.py:536] [test-clean-greedy_search] %WER 10.26% [5394 / 52576, 598 ins, 569 del, 4227 sub ]
90
+ 2023-01-09 01:13:09,683 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
91
+ 2023-01-09 01:13:09,683 INFO [decode.py:508]
92
+ For test-clean, WER of different settings are:
93
+ greedy_search 10.26 best for test-clean
94
+
95
+ 2023-01-09 01:13:10,353 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
96
+ 2023-01-09 01:13:25,888 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
97
+ 2023-01-09 01:13:25,960 INFO [utils.py:536] [test-other-greedy_search] %WER 25.13% [13156 / 52343, 1217 ins, 1939 del, 10000 sub ]
98
+ 2023-01-09 01:13:26,121 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search/errs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
99
+ 2023-01-09 01:13:26,121 INFO [decode.py:508]
100
+ For test-other, WER of different settings are:
101
+ greedy_search 25.13 best for test-other
102
+
103
+ 2023-01-09 01:13:26,121 INFO [decode.py:730] Done!
middle_bpe_500/decode_results/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/greedy_search/wer-summary-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ greedy_search 10.26
middle_bpe_500/decode_results/greedy_search/wer-summary-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ greedy_search 25.13
middle_bpe_500/decode_results/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/modified_beam_search/log-epoch-30-avg-2-modified_beam_search-beam-size-4-uam-2023-01-09-01-17-06 ADDED
@@ -0,0 +1,107 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2023-01-09 01:17:06,332 INFO [decode.py:565] Decoding started
2
+ 2023-01-09 01:17:06,332 INFO [decode.py:571] Device: cuda:0
3
+ 2023-01-09 01:17:06,514 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
4
+ 2023-01-09 01:17:06,554 INFO [decode.py:588] { 'activation': 'doubleswish',
5
+ 'avg': 2,
6
+ 'batch_idx_train': 0,
7
+ 'beam': 20.0,
8
+ 'beam_size': 4,
9
+ 'best_train_epoch': -1,
10
+ 'best_train_loss': inf,
11
+ 'best_valid_epoch': -1,
12
+ 'best_valid_loss': inf,
13
+ 'blank_id': 0,
14
+ 'bucketing_sampler': True,
15
+ 'channels': 300,
16
+ 'concatenate_cuts': False,
17
+ 'context_size': 2,
18
+ 'conv_layers': 18,
19
+ 'decoder_dim': 256,
20
+ 'decoding_method': 'modified_beam_search',
21
+ 'drop_last': True,
22
+ 'duration_factor': 1.0,
23
+ 'enable_musan': True,
24
+ 'enable_spec_aug': True,
25
+ 'encoder_dim': 256,
26
+ 'env_info': { 'IP address': '127.0.1.1',
27
+ 'hostname': 'kao-dgxa-f12-u17',
28
+ 'icefall-git-branch': 'tiny',
29
+ 'icefall-git-date': 'Mon Jan 2 00:08:32 2023',
30
+ 'icefall-git-sha1': '2fd970b-dirty',
31
+ 'icefall-path': '/home/jsong/git/icefall',
32
+ 'k2-build-type': 'Release',
33
+ 'k2-git-date': 'Fri Nov 25 08:23:51 2022',
34
+ 'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
35
+ 'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
36
+ 'k2-version': '1.23.2',
37
+ 'k2-with-cuda': True,
38
+ 'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
39
+ 'lhotse-version': '1.7.0',
40
+ 'python-version': '3.9',
41
+ 'torch-cuda-available': True,
42
+ 'torch-cuda-version': '11.3',
43
+ 'torch-version': '1.12.0'},
44
+ 'epoch': 30,
45
+ 'exp_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay'),
46
+ 'feature_dim': 80,
47
+ 'full_libri': True,
48
+ 'gap': 1.0,
49
+ 'input_strategy': 'PrecomputedFeatures',
50
+ 'iter': 0,
51
+ 'joiner_dim': 256,
52
+ 'lang_dir': PosixPath('data/lang_bpe_500'),
53
+ 'log_interval': 500,
54
+ 'manifest_dir': PosixPath('data/fbank'),
55
+ 'max_contexts': 8,
56
+ 'max_duration': 600,
57
+ 'max_states': 64,
58
+ 'max_sym_per_frame': 1,
59
+ 'nbest_scale': 0.5,
60
+ 'ngram_lm_scale': 0.1,
61
+ 'num_buckets': 30,
62
+ 'num_paths': 100,
63
+ 'num_workers': 2,
64
+ 'on_the_fly_feats': False,
65
+ 'res_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search'),
66
+ 'reset_interval': 200,
67
+ 'return_cuts': True,
68
+ 'shuffle': True,
69
+ 'skip_add': True,
70
+ 'spec_aug_time_warp_factor': 80,
71
+ 'subsampling_factor': 4,
72
+ 'suffix': 'epoch-30-avg-2-modified_beam_search-beam-size-4-uam',
73
+ 'unk_id': 2,
74
+ 'use_averaged_model': True,
75
+ 'use_double_scores': True,
76
+ 'use_dscnn': True,
77
+ 'valid_interval': 9000,
78
+ 'vocab_size': 500,
79
+ 'warm_step': 5000}
80
+ 2023-01-09 01:17:06,555 INFO [decode.py:590] About to create model
81
+ 2023-01-09 01:17:08,981 INFO [train.py:425] Encoder MAC ops for 10 seconds of audio is 501.07M
82
+ 2023-01-09 01:17:08,984 INFO [decode.py:659] Calculating the averaged model over epoch range from 28 (excluded) to 30
83
+ 2023-01-09 01:17:09,308 INFO [decode.py:697] Number of model parameters: 2735794
84
+ 2023-01-09 01:17:09,308 INFO [decode.py:698] Parameters for transducer decoding: 2350294
85
+ 2023-01-09 01:17:09,308 INFO [asr_datamodule.py:443] About to get test-clean cuts
86
+ 2023-01-09 01:17:09,309 INFO [asr_datamodule.py:450] About to get test-other cuts
87
+ 2023-01-09 01:17:13,410 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
88
+ 2023-01-09 01:18:06,305 INFO [decode.py:459] batch 20/?, cuts processed until now is 1430
89
+ 2023-01-09 01:18:50,859 INFO [decode.py:459] batch 40/?, cuts processed until now is 2561
90
+ 2023-01-09 01:18:52,480 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
91
+ 2023-01-09 01:18:52,547 INFO [utils.py:536] [test-clean-beam_size_4] %WER 9.43% [4959 / 52576, 592 ins, 455 del, 3912 sub ]
92
+ 2023-01-09 01:18:52,730 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
93
+ 2023-01-09 01:18:52,731 INFO [decode.py:508]
94
+ For test-clean, WER of different settings are:
95
+ beam_size_4 9.43 best for test-clean
96
+
97
+ 2023-01-09 01:18:55,707 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
98
+ 2023-01-09 01:19:46,663 INFO [decode.py:459] batch 20/?, cuts processed until now is 1646
99
+ 2023-01-09 01:20:28,554 INFO [decode.py:459] batch 40/?, cuts processed until now is 2870
100
+ 2023-01-09 01:20:30,170 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
101
+ 2023-01-09 01:20:30,285 INFO [utils.py:536] [test-other-beam_size_4] %WER 23.53% [12315 / 52343, 1237 ins, 1620 del, 9458 sub ]
102
+ 2023-01-09 01:20:30,446 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
103
+ 2023-01-09 01:20:30,446 INFO [decode.py:508]
104
+ For test-other, WER of different settings are:
105
+ beam_size_4 23.53 best for test-other
106
+
107
+ 2023-01-09 01:20:30,446 INFO [decode.py:730] Done!
middle_bpe_500/decode_results/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt ADDED
The diff for this file is too large to render. See raw diff
 
middle_bpe_500/decode_results/modified_beam_search/wer-summary-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ beam_size_4 9.43
middle_bpe_500/decode_results/modified_beam_search/wer-summary-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ settings WER
2
+ beam_size_4 23.53
middle_bpe_500/exp/cpu_jit.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dc21aa9bb4d0233243f57c76f52847e4c55f11b97cd71ae2e18128e30214b16d
3
+ size 11207430
middle_bpe_500/exp/log/log-train-2023-01-06-07-16-15 ADDED
The diff for this file is too large to render. See raw diff