Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fast_beam_search_nbest gives very high WER compared to fast_beam_search and greedy. #1668

Open
chirag-augnito opened this issue Jun 24, 2024 · 2 comments

Comments

@chirag-augnito
Copy link

chirag-augnito commented Jun 24, 2024

Hi,
I am getting around 3% wer in fast-beam-search and greedy-search. However, I am getting 70% WER when I use fast-beam-search-ngram. My decode configuration looks as below. I am using pruned_transducer_stateless7_streaming from librispeech recipe.

Most of my words are getting deleted.

./local/augnito/pruned_transducer_stateless7_streaming/decode.py \
--epoch 19 \
--avg 1 \
--use-averaged-model False \
--exp-dir ${exp_dir} \
--max-duration 200 \
--decode-chunk-len 32 \
--decoding-method fast_beam_search_nbest \
--beam 20.0 \
--max-contexts 8 \
--max-states 16 \
--num-paths 200 \
--ngram-lm-scale 0.01 \
--manifest-dir $manifest_dir
@csukuangfj
Copy link
Collaborator

csukuangfj commented Jun 24, 2024

Could you post the error patterns from errs-xxx file?

@chiragpatel39
Copy link

chiragpatel39 commented Jun 24, 2024

%WER = 73.96
Errors: 1 insertions, 252 deletions, 31 substitutions, over 384 reference words (101 correct)
Search below for sections starting with PER-UTT DETAILS:, SUBSTITUTIONS:, DELETIONS:, INSERTIONS:, PER-WORD STATS:

PER-UTT DETAILS: corr or (ref->hyp)
utt1: (ADC and high T2 and diffusion and FLAIR signal->e two) measuring
utt2: (extension to the prostatic urethra which is displaced posteriorly and to the->*) left side

They are mostly deletions

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants