New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

make sure inputs live on CPU for ctc decoder #2289

Closed

xiaohui-zhang wants to merge 9 commits into pytorch:main from xiaohui-zhang:test

Contributor

xiaohui-zhang commented Mar 24, 2022 •

edited

Loading

Addressing the issue #2274:
Raise Runtime errors when the input tensors to the CTC decoder are GPU tensors since the CTC decoder only runs on CPU. Also update the data type check to use "raise" rather than "assert".

Pull Request resolved: #2289
GitHub Author: xiaohui-zhang xiaohuizhang@fb.com

facebook-github-bot added the CLA Signed label

mthrok reviewed

View reviewed changes

torchaudio/prototype/ctc_decoder/ctc_decoder.py Outdated

                       B, T, N = emissions.size()
                       if lengths is None:
                           lengths = torch.full((B,), T)
+                      assert not emissions.is_cuda
+                      assert not lengths.is_cuda

Collaborator

mthrok Mar 24, 2022

The logic it self looks good. Couple of suggestions.

For a better UX, could you provide an error message that tell users what to do (as opposed to what was wrong)?
Please perform the input validation as soon as possible, before any operation is performed. Otherwise the operation before validation could be wasteful
This is not a written rule, but assert is more fore internal assertion. (although it is used in L140, which I guess was missed at a review time) So can replace it with if <condition>: raise <something>?

carolineechen reviewed

View reviewed changes

torchaudio/prototype/ctc_decoder/ctc_decoder.py Outdated

                       Args:
                           emissions (torch.FloatTensor): tensor of shape `(batch, frame, num_tokens)` storing sequences of
-                              probability distribution over labels; output of acoustic model
+                              probability distribution over labels; output of acoustic model. It must lives on CPU.

Contributor

carolineechen Mar 24, 2022

Suggested change

      
                            probability distribution over labels; output of acoustic model. It must lives on CPU.
          
                            probability distribution over labels; output of acoustic model. It must live on CPU.

torchaudio/prototype/ctc_decoder/ctc_decoder.py Outdated

                           lengths (Tensor or None, optional): tensor of shape `(batch, )` storing the valid length of
-                              in time axis of the output Tensor in each batch
+                              in time axis of the output Tensor in each batch. It must lives on CPU.

Contributor

carolineechen Mar 24, 2022

Suggested change

      
                            in time axis of the output Tensor in each batch. It must lives on CPU.
          
                            in time axis of the output Tensor in each batch. It must live on CPU.

Contributor Author

xiaohui-zhang commented Mar 24, 2022

thanks @carolineechen and @mthrok for the thorough review of my first PR!

hwangjeff reviewed

View reviewed changes

torchaudio/prototype/ctc_decoder/ctc_decoder.py Outdated

+                          raise ValueError('emissions must be float32.')
+                      if emissions.is_cuda:
+                          raise Exception('emissions must live on CPU.')

Contributor

hwangjeff Mar 24, 2022

it'd be better to raise a more specific exception, e.g. RuntimeError at min

torchaudio/prototype/ctc_decoder/ctc_decoder.py Outdated

                       Args:
                           emissions (torch.FloatTensor): tensor of shape `(batch, frame, num_tokens)` storing sequences of
-                              probability distribution over labels; output of acoustic model
+                              probability distribution over labels; output of acoustic model. It must live on CPU.

Contributor

hwangjeff Mar 24, 2022

perhaps just "CPU tensor of shape (batch, frame, num_tokens) storing sequences of probability distribution over labels; output of acoustic model."? sounds a little more concrete than "live on CPU"

and so forth below

torchaudio/prototype/ctc_decoder/ctc_decoder.py Outdated

                       B, T, N = emissions.size()
                       if lengths is None:
                           lengths = torch.full((B,), T)

Contributor

hwangjeff Mar 24, 2022

can remove extra whitespace

xiaohui-zhang force-pushed the test branch from 0006342 to 3c44919 Compare

March 29, 2022 20:45

mthrok approved these changes

View reviewed changes

hwangjeff reviewed

View reviewed changes

torchaudio/prototype/ctc_decoder/ctc_decoder.py Outdated

+                          raise ValueError("emissions must be float32.")
+                      if emissions.is_cuda:
+                          raise RuntimeError("emissions must live on CPU.")

Contributor

hwangjeff Mar 29, 2022

similar nit as above

Suggested change

      
                        raise RuntimeError("emissions must live on CPU.")
          
                        raise RuntimeError("emissions must be a CPU tensor.")

torchaudio/prototype/ctc_decoder/ctc_decoder.py Outdated

+                          raise RuntimeError("emissions must live on CPU.")
+                      if lengths is not None and lengths.is_cuda:
+                          raise RuntimeError("lengths must live on CPU.")

Contributor

hwangjeff Mar 29, 2022

and here

Suggested change

      
                        raise RuntimeError("lengths must live on CPU.")
          
                        raise RuntimeError("lengths must be a CPU tensor.")

nateanl approved these changes

View reviewed changes

xiaohui-zhang and others added 9 commits

March 30, 2022 17:35


          make sure inputs live on CPU for ctc decoder

c49dcef

fix

253b2c2

fix

cd6adb7

fix

a4145b9

fix

926576d


          fix format

f6f27a7


          fix format

9ce4580


          fix format

d1ccb98


          fix format

7946a10

xiaohui-zhang force-pushed the test branch from 1809b3f to 7946a10 Compare

March 30, 2022 17:35

Contributor

facebook-github-bot commented Mar 30, 2022

@xiaohui-zhang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot closed this in

cfa5a38

xiaohui-zhang deleted the test branch

March 30, 2022 18:57

github-actions bot commented Mar 30, 2022

Hey @xiaohui-zhang.
You merged this PR, but labels were not properly added. Please add a primary and secondary label (See https://github.com/pytorch/audio/blob/main/.github/process_commit.py)

xiaohui-zhang added a commit to xiaohui-zhang/audio that referenced this pull request


          make sure inputs live on CPU for ctc decoder (pytorch#2289)

197f32a

Summary:
Addressing the issue pytorch#2274:
Raise Runtime errors when the input tensors to the CTC decoder are GPU tensors since the CTC decoder only runs on CPU. Also update the data type check to use "raise" rather than "assert".

 ---
Pull Request resolved: pytorch#2289

Reviewed By: mthrok

Differential Revision: D35255630

Pulled By: xiaohui-zhang

fbshipit-source-id: d6c6e88d9ad4b9690bb741557fa9a9504e60872e

xiaohui-zhang added the improvement label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed improvement