Fix prelu_backward TensorIterator split #36134

zasdfgbnm · 2020-04-07T09:06:33Z

We should have

    for (auto& sub_iter : iter.with_32bit_indexing()) {
      launch_prelu_cuda_backward_share_weights_kernel(sub_iter, weight_data);
    }

But I mistakenly wrote it as

    for (auto& sub_iter : iter.with_32bit_indexing()) {
      launch_prelu_cuda_backward_share_weights_kernel(iter, weight_data);
    }

in my previous PR. Which leads to infinite recursion on it.

I found this bug when working on #34004

I also add a TORCH_INTERNAL_ASSERT_DEBUG_ONLY to test for this.

Besides, the caller is already guaranteed contiguous, so we don't need to handle no-contiguous tensors.

Also add a `TORCH_INTERNAL_ASSERT_DEBUG_ONLY` to test for this.

dr-ci · 2020-04-07T09:17:12Z

💊 Build failures summary and remediations

As of commit dcdbc28 (more details on the Dr. CI page):

✅ None of the build failures appear to be your fault 💚

1/1 tentatively recognized as flaky ❄️
- Click here to rerun these jobs

❄️ 1 tentatively flaky failure

1 failure tentatively classified as flaky but reruns have not yet been triggered to confirm:

pytorch_linux_xenial_cuda10_2_cudnn7_py3_gcc7_test (1/1)

Step: "Set Up CI Environment After attach_workspace" (full log | pattern match details | 🔁 rerun) ❄️

WARNING: infoROM is corrupted at gpu 0000:00:1D.0

|   1  Tesla M60           Off  | 00000000:00:1E.0 Off |                    0 | 
| N/A   28C    P0    37W / 150W |      0MiB /  7618MiB |     98%      Default | 
+-------------------------------+----------------------+----------------------+ 
                                                                                
+-----------------------------------------------------------------------------+ 
| Processes:                                                       GPU Memory | 
|  GPU       PID   Type   Process name                             Usage      | 
|=============================================================================| 
|  No running processes found                                                 | 
+-----------------------------------------------------------------------------+ 
WARNING: infoROM is corrupted at gpu 0000:00:1D.0

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

See how this bot performed.

This comment has been revised 27 times.

zasdfgbnm · 2020-04-07T11:14:57Z

Wait, please hold on for now, seems the testing for prelu backward is missing some cases, I will add them later in this PR.

ailzhang · 2020-04-07T18:52:04Z

@zasdfgbnm If you rebase on top of pytorch/master the XLA failure should be gone as well. Thanks!

…tch-6

zasdfgbnm · 2020-04-22T06:40:41Z

aten/src/ATen/native/cuda/Activation.cu

-      OffsetCalculator<2>(iter.ndim(), iter.shape().data(), out_strides.data())
-    );
-  }
+  TORCH_INTERNAL_ASSERT(iter.is_contiguous());


Caller of launch_prelu_cuda_backward_share_weights_kernel already guarantees contiguous.

zasdfgbnm · 2020-04-22T06:45:27Z

@VitalyFedyunin This PR is now ready. This is a bug fix for my previous prelu CUDA_tensor_apply4 PR.

facebook-github-bot

@VitalyFedyunin has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-04-23T18:25:44Z

@VitalyFedyunin merged this pull request in 438aed6.

Fix prelu_backward TensorIterator split

4588625

Also add a `TORCH_INTERNAL_ASSERT_DEBUG_ONLY` to test for this.

zasdfgbnm requested a review from VitalyFedyunin April 7, 2020 09:07

pytorchbot added the open source label Apr 7, 2020

zasdfgbnm added 2 commits April 7, 2020 04:41

save

6aa19b9

save

e333b12

gchanan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Apr 8, 2020

zasdfgbnm changed the title ~~Fix prelu_backward TensorIterator split~~ [WIP] Fix prelu_backward TensorIterator split Apr 14, 2020

zasdfgbnm added 4 commits April 17, 2020 00:36

Merge branch 'master' of github.com:pytorch/pytorch into zasdfgbnm-pa…

4f1630d

…tch-6

save

8a8d5a9

fix

352992d

Merge branch 'master' into zasdfgbnm-patch-6

dcdbc28

zasdfgbnm changed the title ~~[WIP] Fix prelu_backward TensorIterator split~~ Fix prelu_backward TensorIterator split Apr 22, 2020

zasdfgbnm commented Apr 22, 2020

View reviewed changes

facebook-github-bot reviewed Apr 22, 2020

View reviewed changes

VitalyFedyunin approved these changes Apr 22, 2020

View reviewed changes

facebook-github-bot closed this in 438aed6 Apr 23, 2020

zasdfgbnm deleted the zasdfgbnm-patch-6 branch April 23, 2020 17:46

facebook-github-bot added the merged label Apr 23, 2020

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix prelu_backward TensorIterator split #36134

Fix prelu_backward TensorIterator split #36134

Uh oh!

zasdfgbnm commented Apr 7, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Apr 7, 2020 •

edited

Loading

Uh oh!

zasdfgbnm commented Apr 7, 2020

Uh oh!

ailzhang commented Apr 7, 2020

Uh oh!

zasdfgbnm Apr 22, 2020

Uh oh!

zasdfgbnm commented Apr 22, 2020

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Apr 23, 2020

Uh oh!

Uh oh!

Fix prelu_backward TensorIterator split #36134

Fix prelu_backward TensorIterator split #36134

Uh oh!

Conversation

zasdfgbnm commented Apr 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Apr 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 Build failures summary and remediations

❄️ 1 tentatively flaky failure

pytorch_linux_xenial_cuda10_2_cudnn7_py3_gcc7_test (1/1)

Uh oh!

zasdfgbnm commented Apr 7, 2020

Uh oh!

ailzhang commented Apr 7, 2020

Uh oh!

zasdfgbnm Apr 22, 2020

Choose a reason for hiding this comment

Uh oh!

zasdfgbnm commented Apr 22, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Apr 23, 2020

Uh oh!

Uh oh!

zasdfgbnm commented Apr 7, 2020 •

edited

Loading

dr-ci bot commented Apr 7, 2020 •

edited

Loading