Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Speaker ID script improvements #1543

Merged
merged 1 commit into from
Apr 14, 2017

Conversation

david-ryan-snyder
Copy link
Contributor

@david-ryan-snyder david-ryan-snyder commented Apr 14, 2017

This PR is for various improvements to the SID recipes. This is primarily motivated by issues tracked on Git or in the Kaldi forums.

  • The i-vector scripts that use nnet2 in sid/*_dnn.sh now support using GPU explicitly. This is in response to complaints of slowness in these scripts. This will close Issue SRE10 v2 Improvements #165.

  • A fix to src/fgmmbin/fgmm-global-init-from-accs.cc so that when a Gaussian has very low occupancy, we don't just crash (See https://groups.google.com/d/msg/kaldi-help/U_L_6IWBN1c/L8oPTcE5AgAJ)

  • LID and SID scripts now do more cleanup with the --cleanup=true option. This closes Issue cleanup when training iVector extractor #1059.

  • SID i-vector training scripts now use '--num-threads N' instead of '-pe smp N.' This closes Issue SID and LID recipes should use --num-threads N #1096. (The LID scripts were already doing the right thing).

  • In sre10/v1/local/plda_scoring.sh added an option for --simple-length-norm (which defaults to 'false' since it gives better performance in SRE10). This closes issue In SID recipes, provide a script-level option for simple-length-normalization #1097.

  • In egs/sre10/{v1,v2}/run.sh, the PLDA scores are now written to exp instead of local. This is better, since v1 and v2 share the same local directory and would override each other otherwise. Also changing the old-style memory options to the new ones (E.g., --mem 5G))

  • The scripts to train the DNN for SRE10 have been moved from sre10/v1/local to sre08/v1/sid/nnet2 (this mirrors what we did with lre07/v1/lid/nnet2). This is consist with other setups, and makes them easier to access by new (or user created) SID recipes.

  • In sre10/v1/local/dnn/run_nnet2_multisplice.sh we now use 8 GPUs to the train the DNN, instead of 18 (which is excessive, and might've been a typo).

  • Various cosmetic fixes: fixed indentation in several sid and lid scripts. Removed trailing whitespace in src/ivectorbin/*cc. Fixed a typo in src/gmm/full-gmm.cc . Changed wording in egs/sre10/v1/local/dnn/train_dnn.sh so that it, which is an nnet2 pnorm recipe, is no longer referred to as the "current best recipe" but rather as an "older nnet2 recipe," which is now the correct thing to say.

…f cosmetic improvements, better error-handling, and various minor fixes. This does not change results.
@danpovey
Copy link
Contributor

Thanks-- so this is ready to review?

@david-ryan-snyder
Copy link
Contributor Author

david-ryan-snyder commented Apr 14, 2017

@danpovey, yes! Thanks!

@danpovey danpovey merged commit 7fcc311 into kaldi-asr:master Apr 14, 2017
kronos-cm added a commit to kronos-cm/kaldi that referenced this pull request Apr 28, 2017
* 'master' of https://github.com/kaldi-asr/kaldi: (21 commits)
  [egs] bug-fix in egs/ami/s5/run_ihm.sh (kaldi-asr#1577)
  [src] Minor bug-fixes in compute-wer-bootci and WSJ run.sh.  Thanks: @osadjadi
  [egs] Add soft link for mini-librispeech setup
  [egs] adding results and cleanup in mini-librispeech
  [egs] Add mini-librispeech example scripts [intended as a sanity-checker/tutorial setup] (kaldi-asr#1566)
  [src] Fix to testing code signal-test.cc, change threshold to resolve failure (kaldi-asr#1565)
  [src] Add documentation for dropout function.
  [src,scripts,egs]  Add dropout for nnet3 LSTMs, with recipes. (kaldi-asr#1537)
  [src] nnet3 online silence weighting - adding frame subsampling factor (kaldi-asr#1559)
  [doc] Small edit to hmm.dox, clarifying something
  [egs] Added check for kaldi_lm being installed in fisher_swbd recipe. (kaldi-asr#1558)
  Update travis.yml so PRs to kaldi_52 are built
  [srcipts] steps/nnet3/report/generate_plots.py: plot 5,50,95th percentile of value and derivative instead of mean+-stddev (kaldi-asr#1472)
  [egs] AMI TDNN Results Update (kaldi-asr#1545)
  [src] add template instantiations for ConvertStringToReal, address issue kaldi-asr#1544
  [egs,scripts,src] SID and LID tools and scripts: cosmetic improvements, better error-handling, and various minor fixes; results unchanged. (kaldi-asr#1543)
  [src] Change ConvertStringToReal to be locale-independent (i.e. always-US).  Fixes android issue. (kaldi-asr#1513)
  [scripts] nnet3 : fix issue where LDA estimation failed for LSTMs with label delay (kaldi-asr#1540)
  [scripts] fix to get_egs_targets.sh (thanks: David Pye)
  [src] Fix copy-feats for using the --write-num-frames and --compress true flags at the same time (kaldi-asr#1541)
  ...
Skaiste pushed a commit to Skaiste/idlak that referenced this pull request Sep 26, 2018
…s, better error-handling, and various minor fixes; results unchanged. (kaldi-asr#1543)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants