add conformer configs for hat model #6372

andrusenkoau · 2023-04-05T13:20:36Z

What does this PR do ?

Add conformer char and bpe configs for hat model (https://arxiv.org/abs/2003.07705)

Collection: [ASR]

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

titu1994

It looks fine, i wonder if we want to create a another directory for it such as asr/conf/conformer_hat since it is technically a different model.
I am fine with this approach too but would like @VahidooX to review / comment on what his preference are for Conformer HAT.

It might be preferable to have a subdirectory conf/conformer/hat/conformer_hat_*.yaml ?

examples/asr/conf/hat/conformer/conformer_hat_bpe.yaml

examples/asr/conf/hat/conformer/conformer_hat_char.yaml

examples/asr/conf/hat/conformer/conformer_hat_bpe.yaml

VahidooX · 2023-04-06T01:44:54Z

It looks fine, i wonder if we want to create a another directory for it such as asr/conf/conformer_hat since it is technically a different model. I am fine with this approach too but would like @VahidooX to review / comment on what his preference are for Conformer HAT.

It might be preferable to have a subdirectory conf/conformer/hat/conformer_hat_*.yaml ?

Looks good to me as long as they have their own separate folder under conformer.

…conf

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

andrusenkoau · 2023-04-11T13:39:18Z

Hi @titu1994 , @VahidooX !
I have fixed configs and added documentation for HAT model. Does it look good for merging?

VahidooX · 2023-04-11T23:23:57Z

README.rst

@@ -75,7 +75,7 @@ Key Features
 * Speech processing
    * `HuggingFace Space for Audio Transcription (File, Microphone and YouTube) <https://huggingface.co/spaces/smajumdar/nemo_multilingual_language_id>`_
    * `Automatic Speech Recognition (ASR) <https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/asr/intro.html>`_
-        * Supported models: Jasper, QuartzNet, CitriNet, Conformer-CTC, Conformer-Transducer, Squeezeformer-CTC, Squeezeformer-Transducer, ContextNet, LSTM-Transducer (RNNT), LSTM-CTC, FastConformer-CTC, FastConformer-Transducer...
+        * Supported models: Jasper, QuartzNet, CitriNet, Conformer-CTC, Conformer-Transducer, Squeezeformer-CTC, Squeezeformer-Transducer, ContextNet, LSTM-Transducer (RNNT), LSTM-CTC, FastConformer-CTC, FastConformer-Transducer, Conformer-HAT...


Would you please also update the following statement to have Hybrid ASR:
Supports CTC, Transducer/RNNT and Hybrid losses/decoders

VahidooX

LGTM! Just left two minor comments.

VahidooX · 2023-04-11T23:24:41Z

docs/source/asr/models.rst

+.. _Conformer-HAT_model:
+
+Conformer-HAT (Hybrid Autoregressive Transducer)
+--------------------------------------


The lines should have the same size as the title.

VahidooX · 2023-04-11T23:29:24Z

docs/source/asr/models.rst

+Conformer HAT model (do not confuse it with Hybrid-Transducer-CTC) is a modification of Conformer-Transducer model based on `Google paper <https://arxiv.org/abs/2003.07705>`_.
+The main idea is to separate labels and blank score predictions, which allows to estimate the internal LM probabilities during decoding.
+When external LM is available for inference, the internal LM can be subtracted from HAT model prediction in beamsearch decoding to improve external LM efficiency.
+It can be helpful in the case of text-only adaptation for new domains.


How can users use this feature?
Do the current LM scripts support it?

By default Conformer HAT model works in decoding time as a standard Transducer model with the same interface. However, if you have an external ngram LM you can use scripts/asr_language_modeling/ngram_lm/eval_beamsearch_ngram_transducer.py script. The new updated version of the script is under reviewing -- #6370

@VahidooX -- could you approve the PR if everything is OK?

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* add conformer configs for hat model Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu>

add conformer configs for hat model

a243476

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

andrusenkoau added the ASR label Apr 5, 2023

titu1994 reviewed Apr 6, 2023

View reviewed changes

titu1994 requested a review from VahidooX April 6, 2023 01:29

VahidooX reviewed Apr 6, 2023

View reviewed changes

examples/asr/conf/hat/conformer/conformer_hat_bpe.yaml Outdated Show resolved Hide resolved

examples/asr/conf/hat/conformer/conformer_hat_bpe.yaml Outdated Show resolved Hide resolved

examples/asr/conf/hat/conformer/conformer_hat_bpe.yaml Outdated Show resolved Hide resolved

andrusenkoau and others added 8 commits April 6, 2023 05:36

Merge branch 'main' of https://github.com/andrusenkoau/NeMo into hat_…

ffeafee

…conf

minor fixes

ccd3689

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

remove previous configs

7918253

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

add hat model

9ad7df4

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

add hat model

ed88234

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

minor fixes

65bd275

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

Merge branch 'main' into hat_conf

ac581e5

Merge branch 'main' into hat_conf

5017f45

VahidooX reviewed Apr 11, 2023

View reviewed changes

VahidooX previously approved these changes Apr 11, 2023

View reviewed changes

VahidooX reviewed Apr 11, 2023

View reviewed changes

andrusenkoau added 2 commits April 12, 2023 00:31

minor fix

a5a6695

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

minor fix

9be6a4b

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

andrusenkoau dismissed VahidooX’s stale review via 9be6a4b April 12, 2023 07:32

andrusenkoau added 3 commits April 12, 2023 11:32

Merge branch 'main' into hat_conf

3fd8ffe

Merge branch 'main' into hat_conf

400a322

Merge branch 'main' into hat_conf

ebb066f

VahidooX approved these changes Apr 14, 2023

View reviewed changes

VahidooX merged commit 5c12524 into NVIDIA:main Apr 14, 2023

hsiehjackson pushed a commit to hsiehjackson/NeMo that referenced this pull request Jun 2, 2023

add conformer configs for hat model (NVIDIA#6372)

a247992

* add conformer configs for hat model Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add conformer configs for hat model #6372

add conformer configs for hat model #6372

andrusenkoau commented Apr 5, 2023 •

edited

Loading

titu1994 left a comment

VahidooX commented Apr 6, 2023

andrusenkoau commented Apr 11, 2023

VahidooX Apr 11, 2023

andrusenkoau Apr 12, 2023

VahidooX left a comment

VahidooX Apr 11, 2023

andrusenkoau Apr 12, 2023

VahidooX Apr 11, 2023

andrusenkoau Apr 12, 2023

andrusenkoau Apr 14, 2023

add conformer configs for hat model #6372

add conformer configs for hat model #6372

Conversation

andrusenkoau commented Apr 5, 2023 • edited Loading

What does this PR do ?

Before your PR is "Ready for review"

titu1994 left a comment

Choose a reason for hiding this comment

VahidooX commented Apr 6, 2023

andrusenkoau commented Apr 11, 2023

VahidooX Apr 11, 2023

Choose a reason for hiding this comment

andrusenkoau Apr 12, 2023

Choose a reason for hiding this comment

VahidooX left a comment

Choose a reason for hiding this comment

VahidooX Apr 11, 2023

Choose a reason for hiding this comment

andrusenkoau Apr 12, 2023

Choose a reason for hiding this comment

VahidooX Apr 11, 2023

Choose a reason for hiding this comment

andrusenkoau Apr 12, 2023

Choose a reason for hiding this comment

andrusenkoau Apr 14, 2023

Choose a reason for hiding this comment

andrusenkoau commented Apr 5, 2023 •

edited

Loading