Add Conformer RNN-T LibriSpeech training recipe #2329

hwangjeff · 2022-04-12T04:08:11Z

Adds Conformer RNN-T LibriSpeech training recipe to examples directory.

Produces 30M-parameter model that achieves the following WER:

	WER
test-clean	0.0310
test-other	0.0805
dev-clean	0.0314
dev-other	0.0827

facebook-github-bot · 2022-04-12T14:34:32Z

@hwangjeff has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

examples/asr/librispeech_conformer_rnnt/train.py

examples/asr/librispeech_conformer_rnnt/lightning.py

facebook-github-bot · 2022-04-12T20:28:56Z

@hwangjeff has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

examples/asr/librispeech_conformer_rnnt/lightning.py

examples/asr/librispeech_conformer_rnnt/README.md

xiaohui-zhang · 2022-04-13T02:49:14Z

examples/asr/librispeech_conformer_rnnt/lightning.py

+    ):
+        super().__init__()
+
+        self.model = conformer_rnnt_base()


I just realized num_symbols is hardcoded as 1024 inside conformer_rnnt_base. IMO conformer_rnnt_base should only be used in test cases. Here we should initiate the model explicitly via conformer_rnnt_model, and pass self.sp_model.get_piece_size() to "num_symbols", and consider exposing more inputs to the input of LightningModule later on.

examples/asr/librispeech_conformer_rnnt/lightning.py

examples/asr/librispeech_conformer_rnnt/train.py

examples/asr/librispeech_conformer_rnnt/lightning.py

facebook-github-bot · 2022-04-13T19:57:08Z

@hwangjeff has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-04-13T22:12:09Z

@hwangjeff has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-04-13T22:27:03Z

@hwangjeff has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

mthrok

Stamp.

github-actions · 2022-04-13T23:57:42Z

Hey @hwangjeff.
You merged this PR, but labels were not properly added. Please add a primary and secondary label (See https://github.com/pytorch/audio/blob/main/.github/process_commit.py)

Summary: Adds Conformer RNN-T LibriSpeech training recipe to examples directory. Produces 30M-parameter model that achieves the following WER: | | WER | |:-------------------:|-------------:| | test-clean | 0.0310 | | test-other | 0.0805 | | dev-clean | 0.0314 | | dev-other | 0.0827 | Pull Request resolved: pytorch#2329 Reviewed By: xiaohui-zhang Differential Revision: D35578727 Pulled By: hwangjeff fbshipit-source-id: afa9146c5b647727b8605d104d928110a1d3976d

hwangjeff added 8 commits April 12, 2022 03:34

Add Conformer RNN-T LibriSpeech training recipe

c893f18

conformer tweaks

87fe3e7

some recipe mods

6bc3f2c

manual optimization mods

e1cc251

conformer mods

a86d0d8

modify recipe to use revised conformer

f14ddf4

remove comments

87c2e5c

Add LibriSpeech Conformer RNN-T example recipe

a088e51

facebook-github-bot added the CLA Signed label Apr 12, 2022

lint

dda2ce3

hwangjeff force-pushed the librispeech_conformer_rnnt branch from 745e36d to dda2ce3 Compare April 12, 2022 04:23

add eval wer

c747eca

hwangjeff marked this pull request as ready for review April 12, 2022 14:33

hwangjeff requested review from mthrok, xiaohui-zhang, nateanl and carolineechen April 12, 2022 14:33

nateanl reviewed Apr 12, 2022

View reviewed changes

address comments

30459cc

xiaohui-zhang approved these changes Apr 13, 2022

View reviewed changes

address feedback

854ece9

hwangjeff force-pushed the librispeech_conformer_rnnt branch from 6fd974d to 854ece9 Compare April 13, 2022 19:45

add assert and comment to lightning module

3437eef

hwangjeff force-pushed the librispeech_conformer_rnnt branch from da751eb to 3437eef Compare April 13, 2022 22:26

mthrok approved these changes Apr 13, 2022

View reviewed changes

facebook-github-bot closed this in c262758 Apr 13, 2022

hwangjeff added new feature example module: models and removed new feature labels Apr 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Conformer RNN-T LibriSpeech training recipe #2329

Add Conformer RNN-T LibriSpeech training recipe #2329

hwangjeff commented Apr 12, 2022 •

edited

Loading

facebook-github-bot commented Apr 12, 2022

facebook-github-bot commented Apr 12, 2022

xiaohui-zhang Apr 13, 2022

facebook-github-bot commented Apr 13, 2022

facebook-github-bot commented Apr 13, 2022

facebook-github-bot commented Apr 13, 2022

mthrok left a comment

github-actions bot commented Apr 13, 2022

Add Conformer RNN-T LibriSpeech training recipe #2329

Add Conformer RNN-T LibriSpeech training recipe #2329

Conversation

hwangjeff commented Apr 12, 2022 • edited Loading

facebook-github-bot commented Apr 12, 2022

facebook-github-bot commented Apr 12, 2022

xiaohui-zhang Apr 13, 2022

Choose a reason for hiding this comment

facebook-github-bot commented Apr 13, 2022

facebook-github-bot commented Apr 13, 2022

facebook-github-bot commented Apr 13, 2022

mthrok left a comment

Choose a reason for hiding this comment

github-actions bot commented Apr 13, 2022

hwangjeff commented Apr 12, 2022 •

edited

Loading