Speaker ID script improvements #1543

david-ryan-snyder · 2017-04-14T04:03:27Z

This PR is for various improvements to the SID recipes. This is primarily motivated by issues tracked on Git or in the Kaldi forums.

The i-vector scripts that use nnet2 in sid/*_dnn.sh now support using GPU explicitly. This is in response to complaints of slowness in these scripts. This will close Issue SRE10 v2 Improvements #165.
A fix to src/fgmmbin/fgmm-global-init-from-accs.cc so that when a Gaussian has very low occupancy, we don't just crash (See https://groups.google.com/d/msg/kaldi-help/U_L_6IWBN1c/L8oPTcE5AgAJ)
LID and SID scripts now do more cleanup with the --cleanup=true option. This closes Issue cleanup when training iVector extractor #1059.
SID i-vector training scripts now use '--num-threads N' instead of '-pe smp N.' This closes Issue SID and LID recipes should use --num-threads N #1096. (The LID scripts were already doing the right thing).
In sre10/v1/local/plda_scoring.sh added an option for --simple-length-norm (which defaults to 'false' since it gives better performance in SRE10). This closes issue In SID recipes, provide a script-level option for simple-length-normalization #1097.
In egs/sre10/{v1,v2}/run.sh, the PLDA scores are now written to exp instead of local. This is better, since v1 and v2 share the same local directory and would override each other otherwise. Also changing the old-style memory options to the new ones (E.g., --mem 5G))
The scripts to train the DNN for SRE10 have been moved from sre10/v1/local to sre08/v1/sid/nnet2 (this mirrors what we did with lre07/v1/lid/nnet2). This is consist with other setups, and makes them easier to access by new (or user created) SID recipes.
In sre10/v1/local/dnn/run_nnet2_multisplice.sh we now use 8 GPUs to the train the DNN, instead of 18 (which is excessive, and might've been a typo).
Various cosmetic fixes: fixed indentation in several sid and lid scripts. Removed trailing whitespace in src/ivectorbin/*cc. Fixed a typo in src/gmm/full-gmm.cc . Changed wording in egs/sre10/v1/local/dnn/train_dnn.sh so that it, which is an nnet2 pnorm recipe, is no longer referred to as the "current best recipe" but rather as an "older nnet2 recipe," which is now the correct thing to say.

…f cosmetic improvements, better error-handling, and various minor fixes. This does not change results.

danpovey · 2017-04-14T04:21:26Z

Thanks-- so this is ready to review?

david-ryan-snyder · 2017-04-14T04:30:12Z

@danpovey, yes! Thanks!

@osadjadi

* 'master' of https://github.com/kaldi-asr/kaldi: (21 commits) [egs] bug-fix in egs/ami/s5/run_ihm.sh (kaldi-asr#1577) [src] Minor bug-fixes in compute-wer-bootci and WSJ run.sh. Thanks: @osadjadi [egs] Add soft link for mini-librispeech setup [egs] adding results and cleanup in mini-librispeech [egs] Add mini-librispeech example scripts [intended as a sanity-checker/tutorial setup] (kaldi-asr#1566) [src] Fix to testing code signal-test.cc, change threshold to resolve failure (kaldi-asr#1565) [src] Add documentation for dropout function. [src,scripts,egs] Add dropout for nnet3 LSTMs, with recipes. (kaldi-asr#1537) [src] nnet3 online silence weighting - adding frame subsampling factor (kaldi-asr#1559) [doc] Small edit to hmm.dox, clarifying something [egs] Added check for kaldi_lm being installed in fisher_swbd recipe. (kaldi-asr#1558) Update travis.yml so PRs to kaldi_52 are built [srcipts] steps/nnet3/report/generate_plots.py: plot 5,50,95th percentile of value and derivative instead of mean+-stddev (kaldi-asr#1472) [egs] AMI TDNN Results Update (kaldi-asr#1545) [src] add template instantiations for ConvertStringToReal, address issue kaldi-asr#1544 [egs,scripts,src] SID and LID tools and scripts: cosmetic improvements, better error-handling, and various minor fixes; results unchanged. (kaldi-asr#1543) [src] Change ConvertStringToReal to be locale-independent (i.e. always-US). Fixes android issue. (kaldi-asr#1513) [scripts] nnet3 : fix issue where LDA estimation failed for LSTMs with label delay (kaldi-asr#1540) [scripts] fix to get_egs_targets.sh (thanks: David Pye) [src] Fix copy-feats for using the --write-num-frames and --compress true flags at the same time (kaldi-asr#1541) ...

…s, better error-handling, and various minor fixes; results unchanged. (kaldi-asr#1543)

[egs,scripts,src] Maintenance on SID and LID scripts. This consists o…

105e494

…f cosmetic improvements, better error-handling, and various minor fixes. This does not change results.

danpovey merged commit 7fcc311 into kaldi-asr:master Apr 14, 2017

Skaiste pushed a commit to Skaiste/idlak that referenced this pull request Sep 26, 2018

[egs,scripts,src] SID and LID tools and scripts: cosmetic improvement…

a61029a

…s, better error-handling, and various minor fixes; results unchanged. (kaldi-asr#1543)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speaker ID script improvements #1543

Speaker ID script improvements #1543

david-ryan-snyder commented Apr 14, 2017 •

edited

Loading

danpovey commented Apr 14, 2017

david-ryan-snyder commented Apr 14, 2017 •

edited

Loading

Speaker ID script improvements #1543

Speaker ID script improvements #1543

Conversation

david-ryan-snyder commented Apr 14, 2017 • edited Loading

danpovey commented Apr 14, 2017

david-ryan-snyder commented Apr 14, 2017 • edited Loading

david-ryan-snyder commented Apr 14, 2017 •

edited

Loading

david-ryan-snyder commented Apr 14, 2017 •

edited

Loading