New recipe: tiny_transducer_ctc #848

wangtiance · 2023-01-17T02:06:19Z

This recipe is intended for streaming ASR on very low cost devices, with model parameters in the range of 1-2M and less than 0.1GOPS . It uses a small convolutional net as the encoder. It is trained with combined transducer and CTC losses, and supports both phone and BPE lexicons. The encoder consists of 2 subsampling layers followed by a stack of Conv1d-batchnorm-activation-causal_squeeze_excite blocks, with optional skip add. It's a bit similar to CitriNet and ContextNet, but even smaller.

For WER and more details see README.md. Note that transducer decoding does NOT require external LM, therefore the WER looks higher than CTC.

wangtiance · 2023-01-28T01:31:47Z

I get noticeable WER improvements just by enabling tf32 (see https://pytorch.org/docs/stable/notes/cuda.html#tensorfloat-32-tf32-on-ampere-devices). Will update the WER once all models' results are ready.

yaozengwei · 2023-01-30T06:57:38Z

Thanks. Could you also support streaming decoding with cached left context for the causal convolutional modules, like streaming_decode.py in recipe conv_emformer_transducer_stateless2?

wangtiance · 2023-01-30T10:01:59Z

Thanks. Could you also support streaming decoding with cached left context for the causal convolutional modules, like streaming_decode.py in recipe conv_emformer_transducer_stateless2?

Looks like there's some work involved, will get to it when I have the time!

JinZr · 2023-10-24T05:35:33Z

Hi wangtiance, it looks like some of the files need to be black formatted to match the requirements, would you please look into that?

Thank you!

wangtiance · 2023-10-26T11:49:59Z

Hi wangtiance, it looks like some of the files need to be black formatted to match the requirements, would you please look into that?

Thank you!

Reformatted. Now all checks have passed.

JinZr

LGTM

wangtiance added 6 commits January 17, 2023 01:31

initial commit

e70010c

update readme

a59c7f6

Merge branch 'k2-fsa:master' into tiny

360fb04

Update README.md

881d9e8

change bool to str2bool for arg parser

623fe22

run validation only at the end of epoch

249b930

wangtiance added 7 commits February 24, 2023 23:36

Merge branch 'k2-fsa:master' into tiny

e447970

Merge branch 'k2-fsa:master' into tiny

5f23026

Merge branch 'k2-fsa:master' into tiny

f021f53

Merge branch 'k2-fsa:master' into tiny

a67b673

Merge branch 'k2-fsa:master' into tiny

49c1159

Merge branch 'k2-fsa:master' into tiny

58cd264

Merge branch 'k2-fsa:master' into tiny

4adcf68

wangtiance added 5 commits October 26, 2023 16:36

Merge branch 'k2-fsa:master' into tiny

7c7c9da

black format

9ec52fd

Merge branch 'k2-fsa:master' into tiny

ff86c18

black format

e8b42ba

Merge branch 'tiny' of https://github.com/wangtiance/icefall into tiny

8327a81

JinZr approved these changes Oct 30, 2023

View reviewed changes

JinZr merged commit c970df5 into k2-fsa:master Oct 30, 2023
3 checks passed

JinZr mentioned this pull request Oct 31, 2023

Minor refinements for some stale but recently merged PRs #1354

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New recipe: tiny_transducer_ctc #848

New recipe: tiny_transducer_ctc #848

wangtiance commented Jan 17, 2023 •

edited

Loading

wangtiance commented Jan 28, 2023

yaozengwei commented Jan 30, 2023

wangtiance commented Jan 30, 2023

JinZr commented Oct 24, 2023

wangtiance commented Oct 26, 2023

JinZr left a comment

New recipe: tiny_transducer_ctc #848

New recipe: tiny_transducer_ctc #848

Conversation

wangtiance commented Jan 17, 2023 • edited Loading

wangtiance commented Jan 28, 2023

yaozengwei commented Jan 30, 2023

wangtiance commented Jan 30, 2023

JinZr commented Oct 24, 2023

wangtiance commented Oct 26, 2023

JinZr left a comment

Choose a reason for hiding this comment

wangtiance commented Jan 17, 2023 •

edited

Loading