WIP: Speaker ID script improvements #1538

david-ryan-snyder · 2017-04-10T16:29:05Z

This WIP PR is for various improvements to the SID recipes. This is primarily motivated by issues tracked on Git or in the Kaldi forums.

The i-vector scripts that use nnet2 in sid/*_dnn.sh now support using GPU explicitly. This is in response to complaints of slowness in these scripts. This will close Issue SRE10 v2 Improvements #165.
A fix to src/fgmmbin/fgmm-global-init-from-accs.cc so that when a Gaussian has very low occupancy, we don't just crash (See https://groups.google.com/d/msg/kaldi-help/U_L_6IWBN1c/L8oPTcE5AgAJ)
LID and SID scripts now do more cleanup with the --cleanup=true option. This closes Issue cleanup when training iVector extractor #1059.
SID i-vector training scripts now use '--num-threads N' instead of '-pe smp N.' This closes Issue SID and LID recipes should use --num-threads N #1096. (The LID scripts were already doing the right thing).
In sre10/v1/local/plda_scoring.sh added an option for --simple-length-norm (which defaults to 'false' since it gives better performance in SRE10). This closes issue In SID recipes, provide a script-level option for simple-length-normalization #1097.
In egs/sre10/{v1,v2}/run.sh, the PLDA scores are now written to exp instead of local. This is better, since v1 and v2 share the same local directory and would override each other otherwise. Also changing the old-style memory options to the new ones (E.g., --mem 5G))
The scripts to train the DNN for SRE10 have been moved from sre10/v1/local to sre08/v1/sid/nnet2 (this mirrors what we did with lre07/v1/lid/nnet2). This is consist with other setups, and makes them easier to access by new (or user created) SID recipes.
In sre10/v1/local/dnn/run_nnet2_multisplice.sh we now use 8 GPUs to the train the DNN, instead of 18 (which is excessive, and might've been a typo).
Various cosmetic fixes: fixed indentation in several sid and lid scripts. Fixed a typo in src/gmm/full-gmm.cc . Changed wording in egs/sre10/v1/local/dnn/train_dnn.sh so that it, which is an nnet2 pnorm recipe, is no longer referred to as the "current best recipe" but rather as an "older nnet2 recipe," which is now the correct thing to say.

…e10/v1/local to sre08/v1/sid/nnet2

… stage to 0 (kaldi-asr#1416)

…cognition in nnet3. Not fully debugged.

…nt online computation.

…d to allow online computation. Add a basic test for multi-segment computation.

…ed/unnecessary 'request' args for optimization).

…TODO.

…ieces to splice).

… run.

… add tests, and debug to the extent that the tests succeed.

…here possible.

…internal code for shortcut compilation.

…ailing thouth)

…-creation code

…different-sized egs, and different begin/end l/r context

… the num-frames for examples. This code compiles but is not tested.

…NN script. (kaldi-asr#1497)

…aldi-asr#1498)

kaldi-asr#1501)

…asr#1503)

…lstm recipe with -1 delay at lowest lstm layer (kaldi-asr#1505) swbd : Added tdnn_lstm recipe with delay -1 at the lowest lstm layer

)

this improves speed when using hashes.

…nt::Scale() if scale==0.0 (kaldi-asr#1522)

…ectors used in ASR. Results are reported in the default TDNN recipe in AMI. Updating steps/online/nnet2/{train_diag_ubm.sh,train_ivector_extractor.sh} so that they now backup the contents of their destination directory if it already exists. (kaldi-asr#1514)

…(avoid space in version); minor fixes (kaldi-asr#1526)

It appears there may be no good reason to disallow system-wide OpenFst.

…template (kaldi-asr#1530) CuVector::AddColSumMat<float>[no-trans], 16 0.0057 0.0172 3.01x CuVector::AddColSumMat<float>[no-trans], 32 0.0242 0.0668 2.76x CuVector::AddColSumMat<float>[no-trans], 64 0.0992 0.2577 2.60x CuVector::AddColSumMat<float>[no-trans], 128 0.3747 0.9280 2.48x CuVector::AddColSumMat<float>[no-trans], 256 1.4711 3.0541 2.08x CuVector::AddColSumMat<float>[no-trans], 512 5.1709 9.4713 1.83x CuVector::AddColSumMat<float>[no-trans], 1024 12.4352 20.4517 1.64x CuVector::AddColSumMat<double>[no-trans], 16 0.0060 0.0175 2.91x CuVector::AddColSumMat<double>[no-trans], 32 0.0240 0.0672 2.80x CuVector::AddColSumMat<double>[no-trans], 64 0.1006 0.2712 2.70x CuVector::AddColSumMat<double>[no-trans], 128 0.3691 0.9097 2.46x CuVector::AddColSumMat<double>[no-trans], 256 1.4530 3.1044 2.14x CuVector::AddColSumMat<double>[no-trans], 512 4.4524 7.5872 1.70x CuVector::AddColSumMat<double>[no-trans], 1024 11.1212 16.1423 1.45x

…asr#1529)

- we auto-detect the 'compute capability' problems (these appear as the 'invalid device function'), - we also provide guidelines what to try before posting to forum, and which info to send to us,

…true flags at the same time (kaldi-asr#1541)

…h label delay (kaldi-asr#1540) xconfig : Added delay option for FixedAffineLayer. This will be used for ensuring the model specified in ref.config has at least the context required by the model specified in init.config

david-ryan-snyder · 2017-04-12T23:34:03Z

Something went wrong during the rebasing. Going to try this again in another PR (#1543).

David Snyder and others added 30 commits April 12, 2017 18:34

sid-fix-2017-02-11: [egs,scripts]: Moving SRE10 NNET2 scripts from sr…

ff7f51b

…e10/v1/local to sre08/v1/sid/nnet2

[egs] egs/fisher_swbd/s5/local/online/run_nnet2_ms.sh, change default…

3616ff8

… stage to 0 (kaldi-asr#1416)

[egs] Add example scripts for Frisian-Dutch language (FAME! corpus)

e1083d9

Early parts of 'shortcut' compilation

80295c1

Cosmetic changes in nnet3 code

f16da00

Some code refactoring that will make it easier to implement online re…

06bd75a

…cognition in nnet3. Not fully debugged.

Some bug fixes to previous commit (RE refactoring code in nnet3).

871b39a

Further refactoring to nnet3 compilation to make it easier to impleme…

7efa27f

…nt online computation.

Fix a few bugs shown up by valgrind testing

c5f441b

Refactoring generation of the computation 'steps' for more clarity an…

cab8420

…d to allow online computation. Add a basic test for multi-segment computation.

Some minor refactoring to make online computation easier (remove unus…

6d0d9ff

…ed/unnecessary 'request' args for optimization).

Further progress [note, this is partial work, backing up. Search for …

7a53e66

…TODO.

Going some way towards optimization for online decoding (identified p…

b08940a

…ieces to splice).

Get the online optimization code working to the point where the tests…

57a4af9

… run.

Add a couple of previously omitted files

abdd595

Change name from online to looped (less confusable)

10d6a1a

Finishing the decodable objects (not yet for online computatoin), and…

87f695b

… add tests, and debug to the extent that the tests succeed.

Add decoding program nnet3-latgen-faster-looped

ee9b963

Fix bug discovered by testing code

7327c03

Fix bug discovered by TDNN decoding script

f37d422

Adding another optimization to convert row-wise to whole-matrix ops w…

570e82f

…here possible.

Early parts of 'shortcut' compilation

f337886

Add new type of optimization of per-row commands; finish some of the …

d3acb9e

…internal code for shortcut compilation.

Getting shortcut compilation to the point where it's testable (test f…

b1cb7d3

…ailing thouth)

Fix various bugs in shortcut compilation; add further testing code

e44ba77

Small documentation fix

8fd1959

Remove no-longer-used option --cut-zero-frames from chain supervision…

67a8f7a

…-creation code

Some draft code, on the way to changing egs-extraction code to allow …

1e215a8

…different-sized egs, and different begin/end l/r context

Draft of UtteranceSplitter and related code

0129565

Refactoring the example-extraction for nnet3, for more flexibility in…

4ae5e53

… the num-frames for examples. This code compiles but is not tested.

baali and others added 27 commits April 12, 2017 19:08

[egs] Fixes to URLs in vystadial example script.

0ebbc74

[src] nnet1: fixing issue in multi-task training (kaldi-asr#1491)

7d79572

[build] Bump OpenFst version to v1.6.2 (kaldi-asr#1492)

7c171a5

[egs] swbd/chain : added blstm script using fast-LSTM; added BLSTM+TD…

9a61b88

…NN script. (kaldi-asr#1497)

[egs] update fisher_swbd recipe (fixes to how things are installed). (k…

68849f4

…aldi-asr#1498)

[src] sort cuda kernel function declarations to make searching easier. (

bef410c

kaldi-asr#1501)

[build] Android compilation, bug-fixes (kaldi-asr#1502)

041934f

[doc] Add a note to README.md about Android cross compilation (kaldi-…

f8f83ad

…asr#1503)

[egs] ami : Added tdnn_lstm recipe with fast-lstmp layer. Added tdnn_…

d38d067

…lstm recipe with -1 delay at lowest lstm layer (kaldi-asr#1505) swbd : Added tdnn_lstm recipe with delay -1 at the lowest lstm layer

[scripts] prevent failure when final.ie.id doesn't exist (kaldi-asr#1508

0a6b38e

)

[src] Fix exit code of extract-rows.cc (kaldi-asr#1510)

d954b93

[egs] fixes to babel pipeline; thanks to Fred Richardson (kaldi-asr#1509

5316e68

)

[src,scripts]: Several unrelated cosmetic changes

db27674

[misc] remove eXecute permissions where not needed (kaldi-asr#1515)

ae82d90

[egs] Fix to egs/wsj/s5/run.sh (unset variable) (kaldi-asr#1517)

e5d1de3

[src] Adding noexcept to hashing function objects (kaldi-asr#1519)

e1b7916

this improves speed when using hashes.

[src,doc] Fix several unrelated minor problems. Thanks: gaoxinglong

7a73689

[src] (minor) Added missing SetZero() to NaturalGradientAffineCompone…

e72c15c

…nt::Scale() if scale==0.0 (kaldi-asr#1522)

[build,src,doc] Modify get_version.sh to deal better with whitespace …

c1e7b29

…(avoid space in version); minor fixes (kaldi-asr#1526)

[build]: remove openfst check (kaldi-asr#1531)

53d0e88

It appears there may be no good reason to disallow system-wide OpenFst.

[src] Cosmetic change: remove 'train.tra' from usage messages (kaldi-…

5ac45be

…asr#1529)

[src] nnet1: improving the GPU diagnostics, (kaldi-asr#1532)

5765819

- we auto-detect the 'compute capability' problems (these appear as the 'invalid device function'), - we also provide guidelines what to try before posting to forum, and which info to send to us,

[src] Fix copy-feats for using the --write-num-frames and --compress …

19df56a

…true flags at the same time (kaldi-asr#1541)

[scripts] fix to get_egs_targets.sh (thanks: David Pye)

b1e6ec8

david-ryan-snyder force-pushed the sid-fix-2017-02-11 branch from 1ef0b12 to 6599c9b Compare April 12, 2017 23:12

david-ryan-snyder closed this Apr 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Speaker ID script improvements #1538

WIP: Speaker ID script improvements #1538

david-ryan-snyder commented Apr 10, 2017 •

edited

Loading

david-ryan-snyder commented Apr 12, 2017 •

edited

Loading

WIP: Speaker ID script improvements #1538

WIP: Speaker ID script improvements #1538

Conversation

david-ryan-snyder commented Apr 10, 2017 • edited Loading

david-ryan-snyder commented Apr 12, 2017 • edited Loading

david-ryan-snyder commented Apr 10, 2017 •

edited

Loading

david-ryan-snyder commented Apr 12, 2017 •

edited

Loading