Tiance Wang
commited on
Commit
·
2c2e885
1
Parent(s):
249fb17
upload models and results
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- data/lang_bpe_500/bpe.model +3 -0
- data/lang_phone/uniq_lexicon.txt +0 -0
- large_bpe_500/decode_results/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
- large_bpe_500/decode_results/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
- large_bpe_500/decode_results/fast_beam_search/log-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam-2023-01-16-09-53-08 +107 -0
- large_bpe_500/decode_results/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
- large_bpe_500/decode_results/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
- large_bpe_500/decode_results/fast_beam_search/wer-summary-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +2 -0
- large_bpe_500/decode_results/fast_beam_search/wer-summary-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt +2 -0
- large_bpe_500/decode_results/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +0 -0
- large_bpe_500/decode_results/greedy_search/errs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +0 -0
- large_bpe_500/decode_results/greedy_search/log-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam-2023-01-16-09-52-16 +103 -0
- large_bpe_500/decode_results/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +0 -0
- large_bpe_500/decode_results/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +0 -0
- large_bpe_500/decode_results/greedy_search/wer-summary-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +2 -0
- large_bpe_500/decode_results/greedy_search/wer-summary-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt +2 -0
- large_bpe_500/decode_results/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +0 -0
- large_bpe_500/decode_results/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +0 -0
- large_bpe_500/decode_results/modified_beam_search/log-epoch-30-avg-1-modified_beam_search-beam-size-4-uam-2023-01-16-09-54-54 +107 -0
- large_bpe_500/decode_results/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +0 -0
- large_bpe_500/decode_results/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +0 -0
- large_bpe_500/decode_results/modified_beam_search/wer-summary-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +2 -0
- large_bpe_500/decode_results/modified_beam_search/wer-summary-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt +2 -0
- large_bpe_500/exp/cpu_jit.pt +3 -0
- large_bpe_500/exp/log/log-train-2023-01-09-06-23-27 +0 -0
- large_bpe_500/exp/pretrained.pt +3 -0
- large_bpe_500/exp/tensorboard/events.out.tfevents.1673245407.kao-dgxa-f12-u17.4075883.0 +3 -0
- middle_bpe_500/decode_results/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
- middle_bpe_500/decode_results/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
- middle_bpe_500/decode_results/fast_beam_search/log-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam-2023-01-09-01-15-28 +107 -0
- middle_bpe_500/decode_results/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
- middle_bpe_500/decode_results/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +0 -0
- middle_bpe_500/decode_results/fast_beam_search/wer-summary-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +2 -0
- middle_bpe_500/decode_results/fast_beam_search/wer-summary-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt +2 -0
- middle_bpe_500/decode_results/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +0 -0
- middle_bpe_500/decode_results/greedy_search/errs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +0 -0
- middle_bpe_500/decode_results/greedy_search/log-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam-2023-01-09-01-12-47 +103 -0
- middle_bpe_500/decode_results/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +0 -0
- middle_bpe_500/decode_results/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +0 -0
- middle_bpe_500/decode_results/greedy_search/wer-summary-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +2 -0
- middle_bpe_500/decode_results/greedy_search/wer-summary-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt +2 -0
- middle_bpe_500/decode_results/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +0 -0
- middle_bpe_500/decode_results/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +0 -0
- middle_bpe_500/decode_results/modified_beam_search/log-epoch-30-avg-2-modified_beam_search-beam-size-4-uam-2023-01-09-01-17-06 +107 -0
- middle_bpe_500/decode_results/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +0 -0
- middle_bpe_500/decode_results/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +0 -0
- middle_bpe_500/decode_results/modified_beam_search/wer-summary-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +2 -0
- middle_bpe_500/decode_results/modified_beam_search/wer-summary-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt +2 -0
- middle_bpe_500/exp/cpu_jit.pt +3 -0
- middle_bpe_500/exp/log/log-train-2023-01-06-07-16-15 +0 -0
data/lang_bpe_500/bpe.model
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5db9f109de8776c78b1a52bc69a9694acc09bb4c11373e61794e20b24f6e244d
|
3 |
+
size 244891
|
data/lang_phone/uniq_lexicon.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/fast_beam_search/log-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam-2023-01-16-09-53-08
ADDED
@@ -0,0 +1,107 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
2023-01-16 09:53:08,412 INFO [decode.py:565] Decoding started
|
2 |
+
2023-01-16 09:53:08,412 INFO [decode.py:571] Device: cuda:0
|
3 |
+
2023-01-16 09:53:08,589 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
|
4 |
+
2023-01-16 09:53:08,631 INFO [decode.py:588] { 'activation': 'doubleswish',
|
5 |
+
'avg': 1,
|
6 |
+
'batch_idx_train': 0,
|
7 |
+
'beam': 20.0,
|
8 |
+
'beam_size': 4,
|
9 |
+
'best_train_epoch': -1,
|
10 |
+
'best_train_loss': inf,
|
11 |
+
'best_valid_epoch': -1,
|
12 |
+
'best_valid_loss': inf,
|
13 |
+
'blank_id': 0,
|
14 |
+
'bucketing_sampler': True,
|
15 |
+
'channels': 400,
|
16 |
+
'concatenate_cuts': False,
|
17 |
+
'context_size': 2,
|
18 |
+
'conv_layers': 18,
|
19 |
+
'decoder_dim': 400,
|
20 |
+
'decoding_method': 'fast_beam_search',
|
21 |
+
'drop_last': True,
|
22 |
+
'duration_factor': 1.0,
|
23 |
+
'enable_musan': True,
|
24 |
+
'enable_spec_aug': True,
|
25 |
+
'encoder_dim': 400,
|
26 |
+
'env_info': { 'IP address': '127.0.1.1',
|
27 |
+
'hostname': 'kao-dgxa-f12-u17',
|
28 |
+
'icefall-git-branch': 'tiny',
|
29 |
+
'icefall-git-date': 'Fri Jan 13 07:21:29 2023',
|
30 |
+
'icefall-git-sha1': '5c8e962-dirty',
|
31 |
+
'icefall-path': '/home/jsong/git/icefall',
|
32 |
+
'k2-build-type': 'Release',
|
33 |
+
'k2-git-date': 'Fri Nov 25 08:23:51 2022',
|
34 |
+
'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
|
35 |
+
'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
|
36 |
+
'k2-version': '1.23.2',
|
37 |
+
'k2-with-cuda': True,
|
38 |
+
'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
|
39 |
+
'lhotse-version': '1.7.0',
|
40 |
+
'python-version': '3.9',
|
41 |
+
'torch-cuda-available': True,
|
42 |
+
'torch-cuda-version': '11.3',
|
43 |
+
'torch-version': '1.12.0'},
|
44 |
+
'epoch': 30,
|
45 |
+
'exp_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug'),
|
46 |
+
'feature_dim': 80,
|
47 |
+
'full_libri': True,
|
48 |
+
'gap': 1.0,
|
49 |
+
'input_strategy': 'PrecomputedFeatures',
|
50 |
+
'iter': 0,
|
51 |
+
'joiner_dim': 400,
|
52 |
+
'lang_dir': PosixPath('data/lang_bpe_500'),
|
53 |
+
'log_interval': 500,
|
54 |
+
'manifest_dir': PosixPath('data/fbank'),
|
55 |
+
'max_contexts': 8,
|
56 |
+
'max_duration': 600,
|
57 |
+
'max_states': 64,
|
58 |
+
'max_sym_per_frame': 1,
|
59 |
+
'nbest_scale': 0.5,
|
60 |
+
'ngram_lm_scale': 0.1,
|
61 |
+
'num_buckets': 30,
|
62 |
+
'num_paths': 100,
|
63 |
+
'num_workers': 2,
|
64 |
+
'on_the_fly_feats': False,
|
65 |
+
'res_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search'),
|
66 |
+
'reset_interval': 200,
|
67 |
+
'return_cuts': True,
|
68 |
+
'shuffle': True,
|
69 |
+
'skip_add': True,
|
70 |
+
'spec_aug_time_warp_factor': 80,
|
71 |
+
'subsampling_factor': 4,
|
72 |
+
'suffix': 'epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam',
|
73 |
+
'unk_id': 2,
|
74 |
+
'use_averaged_model': True,
|
75 |
+
'use_double_scores': True,
|
76 |
+
'use_dscnn': True,
|
77 |
+
'valid_interval': 9000,
|
78 |
+
'vocab_size': 500,
|
79 |
+
'warm_step': 5000}
|
80 |
+
2023-01-16 09:53:08,631 INFO [decode.py:590] About to create model
|
81 |
+
2023-01-16 09:53:11,209 INFO [train.py:426] Encoder MAC ops for 10 seconds of audio is 888.44M
|
82 |
+
2023-01-16 09:53:11,213 INFO [decode.py:659] Calculating the averaged model over epoch range from 29 (excluded) to 30
|
83 |
+
2023-01-16 09:53:11,613 INFO [decode.py:697] Number of model parameters: 4821330
|
84 |
+
2023-01-16 09:53:11,614 INFO [decode.py:698] Parameters for transducer decoding: 4219830
|
85 |
+
2023-01-16 09:53:11,614 INFO [asr_datamodule.py:449] About to get test-clean cuts
|
86 |
+
2023-01-16 09:53:11,617 INFO [asr_datamodule.py:456] About to get test-other cuts
|
87 |
+
2023-01-16 09:53:13,953 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
|
88 |
+
2023-01-16 09:53:32,617 INFO [decode.py:459] batch 20/?, cuts processed until now is 1430
|
89 |
+
2023-01-16 09:53:53,226 INFO [decode.py:459] batch 40/?, cuts processed until now is 2561
|
90 |
+
2023-01-16 09:53:53,817 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
|
91 |
+
2023-01-16 09:53:53,883 INFO [utils.py:536] [test-clean-beam_20.0_max_contexts_8_max_states_64] %WER 7.91% [4160 / 52576, 482 ins, 411 del, 3267 sub ]
|
92 |
+
2023-01-16 09:53:54,026 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
|
93 |
+
2023-01-16 09:53:54,026 INFO [decode.py:508]
|
94 |
+
For test-clean, WER of different settings are:
|
95 |
+
beam_20.0_max_contexts_8_max_states_64 7.91 best for test-clean
|
96 |
+
|
97 |
+
2023-01-16 09:53:55,228 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
|
98 |
+
2023-01-16 09:54:12,536 INFO [decode.py:459] batch 20/?, cuts processed until now is 1646
|
99 |
+
2023-01-16 09:54:31,673 INFO [decode.py:459] batch 40/?, cuts processed until now is 2870
|
100 |
+
2023-01-16 09:54:32,255 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
|
101 |
+
2023-01-16 09:54:32,325 INFO [utils.py:536] [test-other-beam_20.0_max_contexts_8_max_states_64] %WER 20.10% [10521 / 52343, 1011 ins, 1417 del, 8093 sub ]
|
102 |
+
2023-01-16 09:54:32,481 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
|
103 |
+
2023-01-16 09:54:32,481 INFO [decode.py:508]
|
104 |
+
For test-other, WER of different settings are:
|
105 |
+
beam_20.0_max_contexts_8_max_states_64 20.1 best for test-other
|
106 |
+
|
107 |
+
2023-01-16 09:54:32,481 INFO [decode.py:730] Done!
|
large_bpe_500/decode_results/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/fast_beam_search/wer-summary-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
beam_20.0_max_contexts_8_max_states_64 7.91
|
large_bpe_500/decode_results/fast_beam_search/wer-summary-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-1-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
beam_20.0_max_contexts_8_max_states_64 20.1
|
large_bpe_500/decode_results/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/greedy_search/errs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/greedy_search/log-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam-2023-01-16-09-52-16
ADDED
@@ -0,0 +1,103 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
2023-01-16 09:52:16,827 INFO [decode.py:565] Decoding started
|
2 |
+
2023-01-16 09:52:16,827 INFO [decode.py:571] Device: cuda:0
|
3 |
+
2023-01-16 09:52:17,018 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
|
4 |
+
2023-01-16 09:52:17,057 INFO [decode.py:588] { 'activation': 'doubleswish',
|
5 |
+
'avg': 1,
|
6 |
+
'batch_idx_train': 0,
|
7 |
+
'beam': 20.0,
|
8 |
+
'beam_size': 4,
|
9 |
+
'best_train_epoch': -1,
|
10 |
+
'best_train_loss': inf,
|
11 |
+
'best_valid_epoch': -1,
|
12 |
+
'best_valid_loss': inf,
|
13 |
+
'blank_id': 0,
|
14 |
+
'bucketing_sampler': True,
|
15 |
+
'channels': 400,
|
16 |
+
'concatenate_cuts': False,
|
17 |
+
'context_size': 2,
|
18 |
+
'conv_layers': 18,
|
19 |
+
'decoder_dim': 400,
|
20 |
+
'decoding_method': 'greedy_search',
|
21 |
+
'drop_last': True,
|
22 |
+
'duration_factor': 1.0,
|
23 |
+
'enable_musan': True,
|
24 |
+
'enable_spec_aug': True,
|
25 |
+
'encoder_dim': 400,
|
26 |
+
'env_info': { 'IP address': '127.0.1.1',
|
27 |
+
'hostname': 'kao-dgxa-f12-u17',
|
28 |
+
'icefall-git-branch': 'tiny',
|
29 |
+
'icefall-git-date': 'Fri Jan 13 07:21:29 2023',
|
30 |
+
'icefall-git-sha1': '5c8e962-dirty',
|
31 |
+
'icefall-path': '/home/jsong/git/icefall',
|
32 |
+
'k2-build-type': 'Release',
|
33 |
+
'k2-git-date': 'Fri Nov 25 08:23:51 2022',
|
34 |
+
'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
|
35 |
+
'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
|
36 |
+
'k2-version': '1.23.2',
|
37 |
+
'k2-with-cuda': True,
|
38 |
+
'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
|
39 |
+
'lhotse-version': '1.7.0',
|
40 |
+
'python-version': '3.9',
|
41 |
+
'torch-cuda-available': True,
|
42 |
+
'torch-cuda-version': '11.3',
|
43 |
+
'torch-version': '1.12.0'},
|
44 |
+
'epoch': 30,
|
45 |
+
'exp_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug'),
|
46 |
+
'feature_dim': 80,
|
47 |
+
'full_libri': True,
|
48 |
+
'gap': 1.0,
|
49 |
+
'input_strategy': 'PrecomputedFeatures',
|
50 |
+
'iter': 0,
|
51 |
+
'joiner_dim': 400,
|
52 |
+
'lang_dir': PosixPath('data/lang_bpe_500'),
|
53 |
+
'log_interval': 500,
|
54 |
+
'manifest_dir': PosixPath('data/fbank'),
|
55 |
+
'max_contexts': 8,
|
56 |
+
'max_duration': 600,
|
57 |
+
'max_states': 64,
|
58 |
+
'max_sym_per_frame': 1,
|
59 |
+
'nbest_scale': 0.5,
|
60 |
+
'ngram_lm_scale': 0.1,
|
61 |
+
'num_buckets': 30,
|
62 |
+
'num_paths': 100,
|
63 |
+
'num_workers': 2,
|
64 |
+
'on_the_fly_feats': False,
|
65 |
+
'res_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search'),
|
66 |
+
'reset_interval': 200,
|
67 |
+
'return_cuts': True,
|
68 |
+
'shuffle': True,
|
69 |
+
'skip_add': True,
|
70 |
+
'spec_aug_time_warp_factor': 80,
|
71 |
+
'subsampling_factor': 4,
|
72 |
+
'suffix': 'epoch-30-avg-1-context-2-max-sym-per-frame-1-uam',
|
73 |
+
'unk_id': 2,
|
74 |
+
'use_averaged_model': True,
|
75 |
+
'use_double_scores': True,
|
76 |
+
'use_dscnn': True,
|
77 |
+
'valid_interval': 9000,
|
78 |
+
'vocab_size': 500,
|
79 |
+
'warm_step': 5000}
|
80 |
+
2023-01-16 09:52:17,058 INFO [decode.py:590] About to create model
|
81 |
+
2023-01-16 09:52:19,589 INFO [train.py:426] Encoder MAC ops for 10 seconds of audio is 888.44M
|
82 |
+
2023-01-16 09:52:19,594 INFO [decode.py:659] Calculating the averaged model over epoch range from 29 (excluded) to 30
|
83 |
+
2023-01-16 09:52:19,977 INFO [decode.py:697] Number of model parameters: 4821330
|
84 |
+
2023-01-16 09:52:19,977 INFO [decode.py:698] Parameters for transducer decoding: 4219830
|
85 |
+
2023-01-16 09:52:19,977 INFO [asr_datamodule.py:449] About to get test-clean cuts
|
86 |
+
2023-01-16 09:52:19,978 INFO [asr_datamodule.py:456] About to get test-other cuts
|
87 |
+
2023-01-16 09:52:21,770 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
|
88 |
+
2023-01-16 09:52:39,401 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
|
89 |
+
2023-01-16 09:52:39,467 INFO [utils.py:536] [test-clean-greedy_search] %WER 8.29% [4359 / 52576, 501 ins, 443 del, 3415 sub ]
|
90 |
+
2023-01-16 09:52:39,617 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
|
91 |
+
2023-01-16 09:52:39,618 INFO [decode.py:508]
|
92 |
+
For test-clean, WER of different settings are:
|
93 |
+
greedy_search 8.29 best for test-clean
|
94 |
+
|
95 |
+
2023-01-16 09:52:40,281 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
|
96 |
+
2023-01-16 09:52:56,315 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
|
97 |
+
2023-01-16 09:52:56,386 INFO [utils.py:536] [test-other-greedy_search] %WER 21.11% [11052 / 52343, 1006 ins, 1534 del, 8512 sub ]
|
98 |
+
2023-01-16 09:52:56,547 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/greedy_search/errs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
|
99 |
+
2023-01-16 09:52:56,547 INFO [decode.py:508]
|
100 |
+
For test-other, WER of different settings are:
|
101 |
+
greedy_search 21.11 best for test-other
|
102 |
+
|
103 |
+
2023-01-16 09:52:56,547 INFO [decode.py:730] Done!
|
large_bpe_500/decode_results/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/greedy_search/wer-summary-test-clean-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
greedy_search 8.29
|
large_bpe_500/decode_results/greedy_search/wer-summary-test-other-greedy_search-epoch-30-avg-1-context-2-max-sym-per-frame-1-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
greedy_search 21.11
|
large_bpe_500/decode_results/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/modified_beam_search/log-epoch-30-avg-1-modified_beam_search-beam-size-4-uam-2023-01-16-09-54-54
ADDED
@@ -0,0 +1,107 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
2023-01-16 09:54:54,538 INFO [decode.py:565] Decoding started
|
2 |
+
2023-01-16 09:54:54,538 INFO [decode.py:571] Device: cuda:0
|
3 |
+
2023-01-16 09:54:54,718 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
|
4 |
+
2023-01-16 09:54:54,758 INFO [decode.py:588] { 'activation': 'doubleswish',
|
5 |
+
'avg': 1,
|
6 |
+
'batch_idx_train': 0,
|
7 |
+
'beam': 20.0,
|
8 |
+
'beam_size': 4,
|
9 |
+
'best_train_epoch': -1,
|
10 |
+
'best_train_loss': inf,
|
11 |
+
'best_valid_epoch': -1,
|
12 |
+
'best_valid_loss': inf,
|
13 |
+
'blank_id': 0,
|
14 |
+
'bucketing_sampler': True,
|
15 |
+
'channels': 400,
|
16 |
+
'concatenate_cuts': False,
|
17 |
+
'context_size': 2,
|
18 |
+
'conv_layers': 18,
|
19 |
+
'decoder_dim': 400,
|
20 |
+
'decoding_method': 'modified_beam_search',
|
21 |
+
'drop_last': True,
|
22 |
+
'duration_factor': 1.0,
|
23 |
+
'enable_musan': True,
|
24 |
+
'enable_spec_aug': True,
|
25 |
+
'encoder_dim': 400,
|
26 |
+
'env_info': { 'IP address': '127.0.1.1',
|
27 |
+
'hostname': 'kao-dgxa-f12-u17',
|
28 |
+
'icefall-git-branch': 'tiny',
|
29 |
+
'icefall-git-date': 'Fri Jan 13 07:21:29 2023',
|
30 |
+
'icefall-git-sha1': '5c8e962-dirty',
|
31 |
+
'icefall-path': '/home/jsong/git/icefall',
|
32 |
+
'k2-build-type': 'Release',
|
33 |
+
'k2-git-date': 'Fri Nov 25 08:23:51 2022',
|
34 |
+
'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
|
35 |
+
'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
|
36 |
+
'k2-version': '1.23.2',
|
37 |
+
'k2-with-cuda': True,
|
38 |
+
'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
|
39 |
+
'lhotse-version': '1.7.0',
|
40 |
+
'python-version': '3.9',
|
41 |
+
'torch-cuda-available': True,
|
42 |
+
'torch-cuda-version': '11.3',
|
43 |
+
'torch-version': '1.12.0'},
|
44 |
+
'epoch': 30,
|
45 |
+
'exp_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug'),
|
46 |
+
'feature_dim': 80,
|
47 |
+
'full_libri': True,
|
48 |
+
'gap': 1.0,
|
49 |
+
'input_strategy': 'PrecomputedFeatures',
|
50 |
+
'iter': 0,
|
51 |
+
'joiner_dim': 400,
|
52 |
+
'lang_dir': PosixPath('data/lang_bpe_500'),
|
53 |
+
'log_interval': 500,
|
54 |
+
'manifest_dir': PosixPath('data/fbank'),
|
55 |
+
'max_contexts': 8,
|
56 |
+
'max_duration': 600,
|
57 |
+
'max_states': 64,
|
58 |
+
'max_sym_per_frame': 1,
|
59 |
+
'nbest_scale': 0.5,
|
60 |
+
'ngram_lm_scale': 0.1,
|
61 |
+
'num_buckets': 30,
|
62 |
+
'num_paths': 100,
|
63 |
+
'num_workers': 2,
|
64 |
+
'on_the_fly_feats': False,
|
65 |
+
'res_dir': PosixPath('tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search'),
|
66 |
+
'reset_interval': 200,
|
67 |
+
'return_cuts': True,
|
68 |
+
'shuffle': True,
|
69 |
+
'skip_add': True,
|
70 |
+
'spec_aug_time_warp_factor': 80,
|
71 |
+
'subsampling_factor': 4,
|
72 |
+
'suffix': 'epoch-30-avg-1-modified_beam_search-beam-size-4-uam',
|
73 |
+
'unk_id': 2,
|
74 |
+
'use_averaged_model': True,
|
75 |
+
'use_double_scores': True,
|
76 |
+
'use_dscnn': True,
|
77 |
+
'valid_interval': 9000,
|
78 |
+
'vocab_size': 500,
|
79 |
+
'warm_step': 5000}
|
80 |
+
2023-01-16 09:54:54,758 INFO [decode.py:590] About to create model
|
81 |
+
2023-01-16 09:54:57,321 INFO [train.py:426] Encoder MAC ops for 10 seconds of audio is 888.44M
|
82 |
+
2023-01-16 09:54:57,325 INFO [decode.py:659] Calculating the averaged model over epoch range from 29 (excluded) to 30
|
83 |
+
2023-01-16 09:54:57,706 INFO [decode.py:697] Number of model parameters: 4821330
|
84 |
+
2023-01-16 09:54:57,706 INFO [decode.py:698] Parameters for transducer decoding: 4219830
|
85 |
+
2023-01-16 09:54:57,706 INFO [asr_datamodule.py:449] About to get test-clean cuts
|
86 |
+
2023-01-16 09:54:57,707 INFO [asr_datamodule.py:456] About to get test-other cuts
|
87 |
+
2023-01-16 09:55:01,888 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
|
88 |
+
2023-01-16 09:55:55,346 INFO [decode.py:459] batch 20/?, cuts processed until now is 1430
|
89 |
+
2023-01-16 09:56:40,761 INFO [decode.py:459] batch 40/?, cuts processed until now is 2561
|
90 |
+
2023-01-16 09:56:42,416 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
|
91 |
+
2023-01-16 09:56:42,520 INFO [utils.py:536] [test-clean-beam_size_4] %WER 7.74% [4072 / 52576, 492 ins, 384 del, 3196 sub ]
|
92 |
+
2023-01-16 09:56:42,664 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
|
93 |
+
2023-01-16 09:56:42,664 INFO [decode.py:508]
|
94 |
+
For test-clean, WER of different settings are:
|
95 |
+
beam_size_4 7.74 best for test-clean
|
96 |
+
|
97 |
+
2023-01-16 09:56:45,681 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
|
98 |
+
2023-01-16 09:57:37,910 INFO [decode.py:459] batch 20/?, cuts processed until now is 1646
|
99 |
+
2023-01-16 09:58:20,895 INFO [decode.py:459] batch 40/?, cuts processed until now is 2870
|
100 |
+
2023-01-16 09:58:22,601 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
|
101 |
+
2023-01-16 09:58:22,672 INFO [utils.py:536] [test-other-beam_size_4] %WER 19.89% [10412 / 52343, 1054 ins, 1301 del, 8057 sub ]
|
102 |
+
2023-01-16 09:58:22,832 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_4m_bpe500_halfdelay_specaug/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
|
103 |
+
2023-01-16 09:58:22,832 INFO [decode.py:508]
|
104 |
+
For test-other, WER of different settings are:
|
105 |
+
beam_size_4 19.89 best for test-other
|
106 |
+
|
107 |
+
2023-01-16 09:58:22,832 INFO [decode.py:730] Done!
|
large_bpe_500/decode_results/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/decode_results/modified_beam_search/wer-summary-test-clean-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
beam_size_4 7.74
|
large_bpe_500/decode_results/modified_beam_search/wer-summary-test-other-beam_size_4-epoch-30-avg-1-modified_beam_search-beam-size-4-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
beam_size_4 19.89
|
large_bpe_500/exp/cpu_jit.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:20897e2653f7130fc2c1a52621cf01bb990010659c886124c961620c58d99d5f
|
3 |
+
size 19576790
|
large_bpe_500/exp/log/log-train-2023-01-09-06-23-27
ADDED
The diff for this file is too large to render.
See raw diff
|
|
large_bpe_500/exp/pretrained.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:be438679835e07dc63718fe490fd838cc532d04f0e51248314f116676bd10851
|
3 |
+
size 19539685
|
large_bpe_500/exp/tensorboard/events.out.tfevents.1673245407.kao-dgxa-f12-u17.4075883.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7694fbf23c8fd399fd58b484a3837fa8f38ac8db1a098dd40e2512677a338e04
|
3 |
+
size 618506
|
middle_bpe_500/decode_results/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/fast_beam_search/log-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam-2023-01-09-01-15-28
ADDED
@@ -0,0 +1,107 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
2023-01-09 01:15:28,211 INFO [decode.py:565] Decoding started
|
2 |
+
2023-01-09 01:15:28,211 INFO [decode.py:571] Device: cuda:0
|
3 |
+
2023-01-09 01:15:28,390 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
|
4 |
+
2023-01-09 01:15:28,425 INFO [decode.py:588] { 'activation': 'doubleswish',
|
5 |
+
'avg': 2,
|
6 |
+
'batch_idx_train': 0,
|
7 |
+
'beam': 20.0,
|
8 |
+
'beam_size': 4,
|
9 |
+
'best_train_epoch': -1,
|
10 |
+
'best_train_loss': inf,
|
11 |
+
'best_valid_epoch': -1,
|
12 |
+
'best_valid_loss': inf,
|
13 |
+
'blank_id': 0,
|
14 |
+
'bucketing_sampler': True,
|
15 |
+
'channels': 300,
|
16 |
+
'concatenate_cuts': False,
|
17 |
+
'context_size': 2,
|
18 |
+
'conv_layers': 18,
|
19 |
+
'decoder_dim': 256,
|
20 |
+
'decoding_method': 'fast_beam_search',
|
21 |
+
'drop_last': True,
|
22 |
+
'duration_factor': 1.0,
|
23 |
+
'enable_musan': True,
|
24 |
+
'enable_spec_aug': True,
|
25 |
+
'encoder_dim': 256,
|
26 |
+
'env_info': { 'IP address': '127.0.1.1',
|
27 |
+
'hostname': 'kao-dgxa-f12-u17',
|
28 |
+
'icefall-git-branch': 'tiny',
|
29 |
+
'icefall-git-date': 'Mon Jan 2 00:08:32 2023',
|
30 |
+
'icefall-git-sha1': '2fd970b-dirty',
|
31 |
+
'icefall-path': '/home/jsong/git/icefall',
|
32 |
+
'k2-build-type': 'Release',
|
33 |
+
'k2-git-date': 'Fri Nov 25 08:23:51 2022',
|
34 |
+
'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
|
35 |
+
'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
|
36 |
+
'k2-version': '1.23.2',
|
37 |
+
'k2-with-cuda': True,
|
38 |
+
'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
|
39 |
+
'lhotse-version': '1.7.0',
|
40 |
+
'python-version': '3.9',
|
41 |
+
'torch-cuda-available': True,
|
42 |
+
'torch-cuda-version': '11.3',
|
43 |
+
'torch-version': '1.12.0'},
|
44 |
+
'epoch': 30,
|
45 |
+
'exp_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay'),
|
46 |
+
'feature_dim': 80,
|
47 |
+
'full_libri': True,
|
48 |
+
'gap': 1.0,
|
49 |
+
'input_strategy': 'PrecomputedFeatures',
|
50 |
+
'iter': 0,
|
51 |
+
'joiner_dim': 256,
|
52 |
+
'lang_dir': PosixPath('data/lang_bpe_500'),
|
53 |
+
'log_interval': 500,
|
54 |
+
'manifest_dir': PosixPath('data/fbank'),
|
55 |
+
'max_contexts': 8,
|
56 |
+
'max_duration': 600,
|
57 |
+
'max_states': 64,
|
58 |
+
'max_sym_per_frame': 1,
|
59 |
+
'nbest_scale': 0.5,
|
60 |
+
'ngram_lm_scale': 0.1,
|
61 |
+
'num_buckets': 30,
|
62 |
+
'num_paths': 100,
|
63 |
+
'num_workers': 2,
|
64 |
+
'on_the_fly_feats': False,
|
65 |
+
'res_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search'),
|
66 |
+
'reset_interval': 200,
|
67 |
+
'return_cuts': True,
|
68 |
+
'shuffle': True,
|
69 |
+
'skip_add': True,
|
70 |
+
'spec_aug_time_warp_factor': 80,
|
71 |
+
'subsampling_factor': 4,
|
72 |
+
'suffix': 'epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam',
|
73 |
+
'unk_id': 2,
|
74 |
+
'use_averaged_model': True,
|
75 |
+
'use_double_scores': True,
|
76 |
+
'use_dscnn': True,
|
77 |
+
'valid_interval': 9000,
|
78 |
+
'vocab_size': 500,
|
79 |
+
'warm_step': 5000}
|
80 |
+
2023-01-09 01:15:28,425 INFO [decode.py:590] About to create model
|
81 |
+
2023-01-09 01:15:30,875 INFO [train.py:425] Encoder MAC ops for 10 seconds of audio is 501.07M
|
82 |
+
2023-01-09 01:15:30,878 INFO [decode.py:659] Calculating the averaged model over epoch range from 28 (excluded) to 30
|
83 |
+
2023-01-09 01:15:31,196 INFO [decode.py:697] Number of model parameters: 2735794
|
84 |
+
2023-01-09 01:15:31,196 INFO [decode.py:698] Parameters for transducer decoding: 2350294
|
85 |
+
2023-01-09 01:15:31,196 INFO [asr_datamodule.py:443] About to get test-clean cuts
|
86 |
+
2023-01-09 01:15:31,197 INFO [asr_datamodule.py:450] About to get test-other cuts
|
87 |
+
2023-01-09 01:15:33,792 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
|
88 |
+
2023-01-09 01:15:52,777 INFO [decode.py:459] batch 20/?, cuts processed until now is 1430
|
89 |
+
2023-01-09 01:16:13,995 INFO [decode.py:459] batch 40/?, cuts processed until now is 2561
|
90 |
+
2023-01-09 01:16:14,590 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
|
91 |
+
2023-01-09 01:16:14,663 INFO [utils.py:536] [test-clean-beam_20.0_max_contexts_8_max_states_64] %WER 9.69% [5096 / 52576, 599 ins, 487 del, 4010 sub ]
|
92 |
+
2023-01-09 01:16:14,812 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search/errs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
|
93 |
+
2023-01-09 01:16:14,812 INFO [decode.py:508]
|
94 |
+
For test-clean, WER of different settings are:
|
95 |
+
beam_20.0_max_contexts_8_max_states_64 9.69 best for test-clean
|
96 |
+
|
97 |
+
2023-01-09 01:16:16,043 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
|
98 |
+
2023-01-09 01:16:33,449 INFO [decode.py:459] batch 20/?, cuts processed until now is 1646
|
99 |
+
2023-01-09 01:16:53,170 INFO [decode.py:459] batch 40/?, cuts processed until now is 2870
|
100 |
+
2023-01-09 01:16:53,753 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
|
101 |
+
2023-01-09 01:16:53,825 INFO [utils.py:536] [test-other-beam_20.0_max_contexts_8_max_states_64] %WER 23.58% [12345 / 52343, 1208 ins, 1744 del, 9393 sub ]
|
102 |
+
2023-01-09 01:16:53,988 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/fast_beam_search/errs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
|
103 |
+
2023-01-09 01:16:53,988 INFO [decode.py:508]
|
104 |
+
For test-other, WER of different settings are:
|
105 |
+
beam_20.0_max_contexts_8_max_states_64 23.58 best for test-other
|
106 |
+
|
107 |
+
2023-01-09 01:16:53,988 INFO [decode.py:730] Done!
|
middle_bpe_500/decode_results/fast_beam_search/recogs-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/fast_beam_search/recogs-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/fast_beam_search/wer-summary-test-clean-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
beam_20.0_max_contexts_8_max_states_64 9.69
|
middle_bpe_500/decode_results/fast_beam_search/wer-summary-test-other-beam_20.0_max_contexts_8_max_states_64-epoch-30-avg-2-beam-20.0-max-contexts-8-max-states-64-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
beam_20.0_max_contexts_8_max_states_64 23.58
|
middle_bpe_500/decode_results/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/greedy_search/errs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/greedy_search/log-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam-2023-01-09-01-12-47
ADDED
@@ -0,0 +1,103 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
2023-01-09 01:12:47,525 INFO [decode.py:565] Decoding started
|
2 |
+
2023-01-09 01:12:47,525 INFO [decode.py:571] Device: cuda:0
|
3 |
+
2023-01-09 01:12:47,708 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
|
4 |
+
2023-01-09 01:12:47,749 INFO [decode.py:588] { 'activation': 'doubleswish',
|
5 |
+
'avg': 2,
|
6 |
+
'batch_idx_train': 0,
|
7 |
+
'beam': 20.0,
|
8 |
+
'beam_size': 4,
|
9 |
+
'best_train_epoch': -1,
|
10 |
+
'best_train_loss': inf,
|
11 |
+
'best_valid_epoch': -1,
|
12 |
+
'best_valid_loss': inf,
|
13 |
+
'blank_id': 0,
|
14 |
+
'bucketing_sampler': True,
|
15 |
+
'channels': 300,
|
16 |
+
'concatenate_cuts': False,
|
17 |
+
'context_size': 2,
|
18 |
+
'conv_layers': 18,
|
19 |
+
'decoder_dim': 256,
|
20 |
+
'decoding_method': 'greedy_search',
|
21 |
+
'drop_last': True,
|
22 |
+
'duration_factor': 1.0,
|
23 |
+
'enable_musan': True,
|
24 |
+
'enable_spec_aug': True,
|
25 |
+
'encoder_dim': 256,
|
26 |
+
'env_info': { 'IP address': '127.0.1.1',
|
27 |
+
'hostname': 'kao-dgxa-f12-u17',
|
28 |
+
'icefall-git-branch': 'tiny',
|
29 |
+
'icefall-git-date': 'Mon Jan 2 00:08:32 2023',
|
30 |
+
'icefall-git-sha1': '2fd970b-dirty',
|
31 |
+
'icefall-path': '/home/jsong/git/icefall',
|
32 |
+
'k2-build-type': 'Release',
|
33 |
+
'k2-git-date': 'Fri Nov 25 08:23:51 2022',
|
34 |
+
'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
|
35 |
+
'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
|
36 |
+
'k2-version': '1.23.2',
|
37 |
+
'k2-with-cuda': True,
|
38 |
+
'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
|
39 |
+
'lhotse-version': '1.7.0',
|
40 |
+
'python-version': '3.9',
|
41 |
+
'torch-cuda-available': True,
|
42 |
+
'torch-cuda-version': '11.3',
|
43 |
+
'torch-version': '1.12.0'},
|
44 |
+
'epoch': 30,
|
45 |
+
'exp_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay'),
|
46 |
+
'feature_dim': 80,
|
47 |
+
'full_libri': True,
|
48 |
+
'gap': 1.0,
|
49 |
+
'input_strategy': 'PrecomputedFeatures',
|
50 |
+
'iter': 0,
|
51 |
+
'joiner_dim': 256,
|
52 |
+
'lang_dir': PosixPath('data/lang_bpe_500'),
|
53 |
+
'log_interval': 500,
|
54 |
+
'manifest_dir': PosixPath('data/fbank'),
|
55 |
+
'max_contexts': 8,
|
56 |
+
'max_duration': 600,
|
57 |
+
'max_states': 64,
|
58 |
+
'max_sym_per_frame': 1,
|
59 |
+
'nbest_scale': 0.5,
|
60 |
+
'ngram_lm_scale': 0.1,
|
61 |
+
'num_buckets': 30,
|
62 |
+
'num_paths': 100,
|
63 |
+
'num_workers': 2,
|
64 |
+
'on_the_fly_feats': False,
|
65 |
+
'res_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search'),
|
66 |
+
'reset_interval': 200,
|
67 |
+
'return_cuts': True,
|
68 |
+
'shuffle': True,
|
69 |
+
'skip_add': True,
|
70 |
+
'spec_aug_time_warp_factor': 80,
|
71 |
+
'subsampling_factor': 4,
|
72 |
+
'suffix': 'epoch-30-avg-2-context-2-max-sym-per-frame-1-uam',
|
73 |
+
'unk_id': 2,
|
74 |
+
'use_averaged_model': True,
|
75 |
+
'use_double_scores': True,
|
76 |
+
'use_dscnn': True,
|
77 |
+
'valid_interval': 9000,
|
78 |
+
'vocab_size': 500,
|
79 |
+
'warm_step': 5000}
|
80 |
+
2023-01-09 01:12:47,750 INFO [decode.py:590] About to create model
|
81 |
+
2023-01-09 01:12:50,232 INFO [train.py:425] Encoder MAC ops for 10 seconds of audio is 501.07M
|
82 |
+
2023-01-09 01:12:50,235 INFO [decode.py:659] Calculating the averaged model over epoch range from 28 (excluded) to 30
|
83 |
+
2023-01-09 01:12:50,572 INFO [decode.py:697] Number of model parameters: 2735794
|
84 |
+
2023-01-09 01:12:50,572 INFO [decode.py:698] Parameters for transducer decoding: 2350294
|
85 |
+
2023-01-09 01:12:50,572 INFO [asr_datamodule.py:443] About to get test-clean cuts
|
86 |
+
2023-01-09 01:12:50,574 INFO [asr_datamodule.py:450] About to get test-other cuts
|
87 |
+
2023-01-09 01:12:52,369 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
|
88 |
+
2023-01-09 01:13:09,470 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
|
89 |
+
2023-01-09 01:13:09,537 INFO [utils.py:536] [test-clean-greedy_search] %WER 10.26% [5394 / 52576, 598 ins, 569 del, 4227 sub ]
|
90 |
+
2023-01-09 01:13:09,683 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search/errs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
|
91 |
+
2023-01-09 01:13:09,683 INFO [decode.py:508]
|
92 |
+
For test-clean, WER of different settings are:
|
93 |
+
greedy_search 10.26 best for test-clean
|
94 |
+
|
95 |
+
2023-01-09 01:13:10,353 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
|
96 |
+
2023-01-09 01:13:25,888 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
|
97 |
+
2023-01-09 01:13:25,960 INFO [utils.py:536] [test-other-greedy_search] %WER 25.13% [13156 / 52343, 1217 ins, 1939 del, 10000 sub ]
|
98 |
+
2023-01-09 01:13:26,121 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/greedy_search/errs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
|
99 |
+
2023-01-09 01:13:26,121 INFO [decode.py:508]
|
100 |
+
For test-other, WER of different settings are:
|
101 |
+
greedy_search 25.13 best for test-other
|
102 |
+
|
103 |
+
2023-01-09 01:13:26,121 INFO [decode.py:730] Done!
|
middle_bpe_500/decode_results/greedy_search/recogs-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/greedy_search/recogs-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/greedy_search/wer-summary-test-clean-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
greedy_search 10.26
|
middle_bpe_500/decode_results/greedy_search/wer-summary-test-other-greedy_search-epoch-30-avg-2-context-2-max-sym-per-frame-1-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
greedy_search 25.13
|
middle_bpe_500/decode_results/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/modified_beam_search/log-epoch-30-avg-2-modified_beam_search-beam-size-4-uam-2023-01-09-01-17-06
ADDED
@@ -0,0 +1,107 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
2023-01-09 01:17:06,332 INFO [decode.py:565] Decoding started
|
2 |
+
2023-01-09 01:17:06,332 INFO [decode.py:571] Device: cuda:0
|
3 |
+
2023-01-09 01:17:06,514 INFO [lexicon.py:168] Loading pre-compiled data/lang_bpe_500/Linv.pt
|
4 |
+
2023-01-09 01:17:06,554 INFO [decode.py:588] { 'activation': 'doubleswish',
|
5 |
+
'avg': 2,
|
6 |
+
'batch_idx_train': 0,
|
7 |
+
'beam': 20.0,
|
8 |
+
'beam_size': 4,
|
9 |
+
'best_train_epoch': -1,
|
10 |
+
'best_train_loss': inf,
|
11 |
+
'best_valid_epoch': -1,
|
12 |
+
'best_valid_loss': inf,
|
13 |
+
'blank_id': 0,
|
14 |
+
'bucketing_sampler': True,
|
15 |
+
'channels': 300,
|
16 |
+
'concatenate_cuts': False,
|
17 |
+
'context_size': 2,
|
18 |
+
'conv_layers': 18,
|
19 |
+
'decoder_dim': 256,
|
20 |
+
'decoding_method': 'modified_beam_search',
|
21 |
+
'drop_last': True,
|
22 |
+
'duration_factor': 1.0,
|
23 |
+
'enable_musan': True,
|
24 |
+
'enable_spec_aug': True,
|
25 |
+
'encoder_dim': 256,
|
26 |
+
'env_info': { 'IP address': '127.0.1.1',
|
27 |
+
'hostname': 'kao-dgxa-f12-u17',
|
28 |
+
'icefall-git-branch': 'tiny',
|
29 |
+
'icefall-git-date': 'Mon Jan 2 00:08:32 2023',
|
30 |
+
'icefall-git-sha1': '2fd970b-dirty',
|
31 |
+
'icefall-path': '/home/jsong/git/icefall',
|
32 |
+
'k2-build-type': 'Release',
|
33 |
+
'k2-git-date': 'Fri Nov 25 08:23:51 2022',
|
34 |
+
'k2-git-sha1': '1feafa064cf3b6c243e6b33b0192601224210937',
|
35 |
+
'k2-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/k2/__init__.py',
|
36 |
+
'k2-version': '1.23.2',
|
37 |
+
'k2-with-cuda': True,
|
38 |
+
'lhotse-path': '/home/jsong/miniconda3/envs/k2/lib/python3.9/site-packages/lhotse/__init__.py',
|
39 |
+
'lhotse-version': '1.7.0',
|
40 |
+
'python-version': '3.9',
|
41 |
+
'torch-cuda-available': True,
|
42 |
+
'torch-cuda-version': '11.3',
|
43 |
+
'torch-version': '1.12.0'},
|
44 |
+
'epoch': 30,
|
45 |
+
'exp_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay'),
|
46 |
+
'feature_dim': 80,
|
47 |
+
'full_libri': True,
|
48 |
+
'gap': 1.0,
|
49 |
+
'input_strategy': 'PrecomputedFeatures',
|
50 |
+
'iter': 0,
|
51 |
+
'joiner_dim': 256,
|
52 |
+
'lang_dir': PosixPath('data/lang_bpe_500'),
|
53 |
+
'log_interval': 500,
|
54 |
+
'manifest_dir': PosixPath('data/fbank'),
|
55 |
+
'max_contexts': 8,
|
56 |
+
'max_duration': 600,
|
57 |
+
'max_states': 64,
|
58 |
+
'max_sym_per_frame': 1,
|
59 |
+
'nbest_scale': 0.5,
|
60 |
+
'ngram_lm_scale': 0.1,
|
61 |
+
'num_buckets': 30,
|
62 |
+
'num_paths': 100,
|
63 |
+
'num_workers': 2,
|
64 |
+
'on_the_fly_feats': False,
|
65 |
+
'res_dir': PosixPath('tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search'),
|
66 |
+
'reset_interval': 200,
|
67 |
+
'return_cuts': True,
|
68 |
+
'shuffle': True,
|
69 |
+
'skip_add': True,
|
70 |
+
'spec_aug_time_warp_factor': 80,
|
71 |
+
'subsampling_factor': 4,
|
72 |
+
'suffix': 'epoch-30-avg-2-modified_beam_search-beam-size-4-uam',
|
73 |
+
'unk_id': 2,
|
74 |
+
'use_averaged_model': True,
|
75 |
+
'use_double_scores': True,
|
76 |
+
'use_dscnn': True,
|
77 |
+
'valid_interval': 9000,
|
78 |
+
'vocab_size': 500,
|
79 |
+
'warm_step': 5000}
|
80 |
+
2023-01-09 01:17:06,555 INFO [decode.py:590] About to create model
|
81 |
+
2023-01-09 01:17:08,981 INFO [train.py:425] Encoder MAC ops for 10 seconds of audio is 501.07M
|
82 |
+
2023-01-09 01:17:08,984 INFO [decode.py:659] Calculating the averaged model over epoch range from 28 (excluded) to 30
|
83 |
+
2023-01-09 01:17:09,308 INFO [decode.py:697] Number of model parameters: 2735794
|
84 |
+
2023-01-09 01:17:09,308 INFO [decode.py:698] Parameters for transducer decoding: 2350294
|
85 |
+
2023-01-09 01:17:09,308 INFO [asr_datamodule.py:443] About to get test-clean cuts
|
86 |
+
2023-01-09 01:17:09,309 INFO [asr_datamodule.py:450] About to get test-other cuts
|
87 |
+
2023-01-09 01:17:13,410 INFO [decode.py:459] batch 0/?, cuts processed until now is 43
|
88 |
+
2023-01-09 01:18:06,305 INFO [decode.py:459] batch 20/?, cuts processed until now is 1430
|
89 |
+
2023-01-09 01:18:50,859 INFO [decode.py:459] batch 40/?, cuts processed until now is 2561
|
90 |
+
2023-01-09 01:18:52,480 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
|
91 |
+
2023-01-09 01:18:52,547 INFO [utils.py:536] [test-clean-beam_size_4] %WER 9.43% [4959 / 52576, 592 ins, 455 del, 3912 sub ]
|
92 |
+
2023-01-09 01:18:52,730 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search/errs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
|
93 |
+
2023-01-09 01:18:52,731 INFO [decode.py:508]
|
94 |
+
For test-clean, WER of different settings are:
|
95 |
+
beam_size_4 9.43 best for test-clean
|
96 |
+
|
97 |
+
2023-01-09 01:18:55,707 INFO [decode.py:459] batch 0/?, cuts processed until now is 52
|
98 |
+
2023-01-09 01:19:46,663 INFO [decode.py:459] batch 20/?, cuts processed until now is 1646
|
99 |
+
2023-01-09 01:20:28,554 INFO [decode.py:459] batch 40/?, cuts processed until now is 2870
|
100 |
+
2023-01-09 01:20:30,170 INFO [decode.py:478] The transcripts are stored in tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
|
101 |
+
2023-01-09 01:20:30,285 INFO [utils.py:536] [test-other-beam_size_4] %WER 23.53% [12315 / 52343, 1237 ins, 1620 del, 9458 sub ]
|
102 |
+
2023-01-09 01:20:30,446 INFO [decode.py:491] Wrote detailed error stats to tiny_transducer_ctc/exp_2m_bpe500_halfdelay/modified_beam_search/errs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
|
103 |
+
2023-01-09 01:20:30,446 INFO [decode.py:508]
|
104 |
+
For test-other, WER of different settings are:
|
105 |
+
beam_size_4 23.53 best for test-other
|
106 |
+
|
107 |
+
2023-01-09 01:20:30,446 INFO [decode.py:730] Done!
|
middle_bpe_500/decode_results/modified_beam_search/recogs-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/modified_beam_search/recogs-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
middle_bpe_500/decode_results/modified_beam_search/wer-summary-test-clean-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
beam_size_4 9.43
|
middle_bpe_500/decode_results/modified_beam_search/wer-summary-test-other-beam_size_4-epoch-30-avg-2-modified_beam_search-beam-size-4-uam.txt
ADDED
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
1 |
+
settings WER
|
2 |
+
beam_size_4 23.53
|
middle_bpe_500/exp/cpu_jit.pt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:dc21aa9bb4d0233243f57c76f52847e4c55f11b97cd71ae2e18128e30214b16d
|
3 |
+
size 11207430
|
middle_bpe_500/exp/log/log-train-2023-01-06-07-16-15
ADDED
The diff for this file is too large to render.
See raw diff
|
|