Use Label-Looping algorithm for RNN-T decoding by default #8831

artbataev · 2024-04-05T13:57:56Z

What does this PR do ?

Enable Label-Looping algorithm introduced in #8286 and #7926 (loop_labels=True) by default for RNN-T greedy decoding.

Collection: [ASR]

Changelog

Enable Label-Looping algorithm by default (loop_labels=true)
fix Label Looping algorithm for RNN-T/TDT + Stateless network
fix tests with custom RNNT Decoder
parametrize tests for batched greedy decoding to test both algorithms (Frame-/Label-Looping)

Usage

Label-Looping algorithm is used by default now for batched greedy decoding.

For Frame-Looping algorithm one can use:

python examples/asr/speech_to_text_eval.py  <...> \
 rnnt_decoding.greedy.loop_labels=false

Jenkins CI

To run Jenkins, a NeMo User with write access must comment jenkins on the PR.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

artbataev · 2024-04-05T19:03:49Z

Not ready yet

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

tests/collections/asr/test_asr_rnnt_encdec_model.py

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

artbataev · 2024-04-08T18:25:37Z

jenkins

artbataev · 2024-04-09T07:08:12Z

nemo/collections/asr/modules/rnnt.py

-        state = [torch.ones([batch, self.context_size], dtype=torch.long, device=y.device) * self.blank_idx]
+        # state contains context_size - 1 elements for each utterance in batch,
+        # consistent with the state returned from StatelessNet.forward
+        state = [torch.ones([batch, self.context_size - 1], dtype=torch.long, device=y.device) * self.blank_idx]


@hainan-xv, please, confirm that I broke nothing when fixing state for the Stateless decoder.
We need the state with the constant size (to allow replacements when we found the end of utterance), and forward returns the state of size [batch_size, context_size - 1]

FYI, you can also use torch.full instead of torch.ones followed by multiplication. No need to change it though.

titu1994

Overall looks great, nice work !

titu1994 · 2024-04-09T19:23:05Z

tests/collections/asr/test_asr_rnnt_encdec_model.py

@@ -73,7 +73,7 @@ def predict(
                return (
                    output,
                    [
-                        torch.tensor([0] * self.vocab_size + [1], dtype=torch.float32)[None, None, :].exand(
+                        torch.tensor([0] * self.vocab_size + [1], dtype=torch.float32)[None, None, :].expand(


How how did this test pass with this error ?

I think this case is redundant and never executed in decoding, but we need to implement this to conform the interface where y is optional (see AbstractRNNTDecoder.predict)

galv

Late review with a few FYI comments. Do we test the cuda graphs implementation with stateless transducers yet?

galv · 2024-04-10T21:19:38Z

nemo/collections/asr/modules/rnnt.py

-        state = [torch.ones([batch, self.context_size], dtype=torch.long, device=y.device) * self.blank_idx]
+        # state contains context_size - 1 elements for each utterance in batch,
+        # consistent with the state returned from StatelessNet.forward
+        state = [torch.ones([batch, self.context_size - 1], dtype=torch.long, device=y.device) * self.blank_idx]


FYI, you can also use torch.full instead of torch.ones followed by multiplication. No need to change it though.

galv · 2024-04-10T21:24:21Z

tests/collections/asr/test_asr_rnnt_encdec_model.py

+            return [
+                torch.tensor([0] * self.vocab_size + [1], dtype=torch.float32)[None, None, :]
+                .expand([1, batch_size, -1])
+                .clone()


torch.repeat or torch.repeat_interleave is probably the better way to do it than expand followed by clone.

artbataev · 2024-04-11T13:15:09Z

Late review with a few FYI comments. Do we test the cuda graphs implementation with stateless transducers yet?

Thanks for the comments, @galv! I will make fixes in next PRs.
Cuda graphs implementation with stateless prediction network is not tested on CI for now (no production model), but I tested loop labels + cuda graphs locally with a custom model, and it works.

* Use Label-Looping algorithm for RNN-T decoding by default * Fix loop labels + stateless decoding --------- Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Use Label-Looping algorithm for RNN-T decoding by default * Fix loop labels + stateless decoding --------- Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: jxin <jxin@nvidia.com>

* Use Label-Looping algorithm for RNN-T decoding by default * Fix loop labels + stateless decoding --------- Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Ao Tang <aot@nvidia.com>

* Use Label-Looping algorithm for RNN-T decoding by default * Fix loop labels + stateless decoding --------- Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

Use Label-Looping algorithm for RNN-T decoding by default

7f1c73f

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

artbataev requested review from titu1994 and galv April 5, 2024 13:57

github-actions bot added the ASR label Apr 5, 2024

artbataev marked this pull request as draft April 5, 2024 18:12

Parametrize decoding tests with loop_labels

0b68c73

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

artbataev added 4 commits April 8, 2024 21:02

Fix loop labels + stateless decoding

64cf99a

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

Fix tests with custom RNNT decoder

a884cb6

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

Merge branch 'main' into rnnt_decoding_loop_labels_default

d86b209

Unify RNN-T and TDT decoding code

158e08c

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

github-advanced-security bot found potential problems Apr 8, 2024

View reviewed changes

tests/collections/asr/test_asr_rnnt_encdec_model.py Fixed Show fixed Hide fixed

Fix classmethod

4a977c4

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

artbataev marked this pull request as ready for review April 9, 2024 07:06

artbataev requested a review from hainan-xv April 9, 2024 07:06

artbataev commented Apr 9, 2024

View reviewed changes

titu1994 approved these changes Apr 9, 2024

View reviewed changes

artbataev merged commit b33af25 into main Apr 10, 2024
127 checks passed

artbataev deleted the rnnt_decoding_loop_labels_default branch April 10, 2024 17:03

galv reviewed Apr 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Label-Looping algorithm for RNN-T decoding by default #8831

Use Label-Looping algorithm for RNN-T decoding by default #8831

artbataev commented Apr 5, 2024 •

edited

Loading

artbataev commented Apr 5, 2024

artbataev commented Apr 8, 2024

artbataev Apr 9, 2024

hainan-xv Apr 10, 2024

galv Apr 10, 2024

titu1994 left a comment

titu1994 Apr 9, 2024

artbataev Apr 10, 2024

galv left a comment

galv Apr 10, 2024

galv Apr 10, 2024

artbataev commented Apr 11, 2024 •

edited

Loading

Use Label-Looping algorithm for RNN-T decoding by default #8831

Use Label-Looping algorithm for RNN-T decoding by default #8831

Conversation

artbataev commented Apr 5, 2024 • edited Loading

What does this PR do ?

Changelog

Usage

Jenkins CI

Before your PR is "Ready for review"

Who can review?

Additional Information

artbataev commented Apr 5, 2024

artbataev commented Apr 8, 2024

artbataev Apr 9, 2024

Choose a reason for hiding this comment

hainan-xv Apr 10, 2024

Choose a reason for hiding this comment

galv Apr 10, 2024

Choose a reason for hiding this comment

titu1994 left a comment

Choose a reason for hiding this comment

titu1994 Apr 9, 2024

Choose a reason for hiding this comment

artbataev Apr 10, 2024

Choose a reason for hiding this comment

galv left a comment

Choose a reason for hiding this comment

galv Apr 10, 2024

Choose a reason for hiding this comment

galv Apr 10, 2024

Choose a reason for hiding this comment

artbataev commented Apr 11, 2024 • edited Loading

artbataev commented Apr 5, 2024 •

edited

Loading

artbataev commented Apr 11, 2024 •

edited

Loading