cankeles commited on
Commit
27e3df6
·
1 Parent(s): 896bc11

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +75 -0
README.md ADDED
@@ -0,0 +1,75 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - asteroid
4
+ - audio
5
+ - ConvTasNet
6
+ - audio-to-audio
7
+ datasets:
8
+ - WHAMRmod
9
+ - enh_single
10
+ license: cc-by-sa-4.0
11
+ ---
12
+ ## Asteroid model `cankeles/ConvTasNet_WHAMRmod_enhsingle_16k`
13
+
14
+ Description:
15
+
16
+ This model was fine tuned on a modified version of WHAMR! where the speakers were taken from audiobook recordings.
17
+ The initial model was taken from here: https://huggingface.co/JorisCos/DCCRNet_Libri1Mix_enhsingle_16k
18
+ This model was trained by M. Can Keles using the WHAM recipe in [Asteroid](https://github.com/asteroid-team/asteroid).
19
+ It was trained on the `enh_single` task of the WHAM dataset.
20
+
21
+ Training config:
22
+
23
+ ```yml
24
+ data:
25
+ mode: min
26
+ nondefault_nsrc: null
27
+ sample_rate: 16000
28
+ task: enh_single
29
+ train_dir: wav16k/min/tr/
30
+ valid_dir: wav16k/min/cv/
31
+ filterbank:
32
+ kernel_size: 16
33
+ n_filters: 512
34
+ stride: 8
35
+ main_args:
36
+ exp_dir: exp/tmp
37
+ help: null
38
+ masknet:
39
+ bn_chan: 128
40
+ hid_chan: 512
41
+ mask_act: relu
42
+ n_blocks: 8
43
+ n_repeats: 3
44
+ n_src: 1
45
+ skip_chan: 128
46
+ optim:
47
+ lr: 0.001
48
+ optimizer: adam
49
+ weight_decay: 0.0
50
+ positional arguments: {}
51
+ training:
52
+ batch_size: 2
53
+ early_stop: true
54
+ epochs: 10
55
+ half_lr: true
56
+ num_workers: 4
57
+ ```
58
+
59
+
60
+ Results:
61
+
62
+ On Libri1Mix min test set :
63
+ ```yml
64
+ si_sdr: 14.743051006476085
65
+ si_sdr_imp: 11.293269700616385
66
+ sdr: 15.300522933671061
67
+ sdr_imp: 11.797860134458015
68
+ sir: Infinity
69
+ sir_imp: NaN
70
+ sar: 15.300522933671061
71
+ sar_imp: 11.797860134458015
72
+ stoi: 0.9310514162434267
73
+ stoi_imp: 0.13513159270288563
74
+ ```
75
+