fix out_of_range bug of multinomial op's cuda kernel #36511

pangyoki · 2021-10-18T08:50:53Z

PR types

Bug fixes

PR changes

OPs

Describe

A bug in sample method (multinomial op) of Categorical API is mentioned in issue #36401.

issue

test case

import paddle
with paddle.no_grad():
    actor=paddle.nn.Sequential(paddle.nn.Linear(20, 2800))
    logits=actor(paddle.rand([20]))
    cat=paddle.distribution.Categorical(logits.exp())
    print(cat.sample([1]))

error message

Error: /paddle/paddle/fluid/operators/multinomial_op.cu:42 Assertion `in_data[id] >= 0.0` failed. The input of multinomial distribution should be >= 0, but got -0.038249.
（省略类似错误）
Error: /paddle/paddle/fluid/operators/multinomial_op.cu:42 Assertion `in_data[id] >= 0.0` failed. The input of multinomial distribution should be >= 0, but got -0.044257.
Traceback (most recent call last):
  File "bug.py", line 6, in <module>
    print(cat.sample([1]))
  File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/distribution.py", line 771, in sample
    sample_index = multinomial(logits, num_samples, True)
  File "/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/tensor/random.py", line 133, in multinomial
    'replacement', replacement)
SystemError: (Fatal) Operator multinomial raises an thrust::system::system_error exception.
The exception content is
:transform: failed to synchronize: cudaErrorLaunchFailure: unspecified launch failure. (at /paddle/paddle/fluid/imperative/tracer.cc:192)

Reason for error

In the cuda kernel implementation, the number of threads exceeding the size of the input array will be used to perform calculations (the reason is that the block size is limited, when setting the grid size, more threads will be set for rounding).

However, when the cuda kernel calculates, it does not limit the array subscripts. As a result, when calculating in the thread, the space exceeding the size of the array is accessed, causing an error.

bug fix

Restrictions on the subscripts of the accessed arrays.

paddle-bot-old · 2021-10-18T08:51:13Z

✅ This PR's description meets the template requirements!
Please wait for other CI results.

paddle-bot-old · 2021-10-18T08:51:18Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

zhiqiu

LGTM

paddle-bot-old · 2021-10-26T02:35:11Z

Sorry to inform you that 0641bbc's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

… fix-multinomial-op

chenwhql

LGTM for PADDLE_ENFORCE

Avin0323

LGTM for PR-CI-OP-benchmark

…ernel (#36511) (#36808) Cherry-pick PR #36511

add unittest

0641bbc

zhiqiu approved these changes Oct 22, 2021

View reviewed changes

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

16c8194

… fix-multinomial-op

chenwhql approved these changes Oct 27, 2021

View reviewed changes

Avin0323 approved these changes Oct 27, 2021

View reviewed changes

pangyoki merged commit 51a3396 into PaddlePaddle:develop Oct 27, 2021

pangyoki added a commit to pangyoki/Paddle that referenced this pull request Oct 27, 2021

add unittest (PaddlePaddle#36511)

e0874eb

pangyoki mentioned this pull request Oct 27, 2021

【Cherry-pick PR 36511】fix out_of_range bug of multinomial op's cuda kernel #36808

Merged

lanxianghit pushed a commit that referenced this pull request Oct 28, 2021

【Cherry-pick PR 36511】fix out_of_range bug of multinomial op's cuda k…

d8ffb26

…ernel (#36511) (#36808) Cherry-pick PR #36511

ghost pushed a commit to piotrekobi/Paddle that referenced this pull request Nov 3, 2021

add unittest (PaddlePaddle#36511)

393eaa5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix out_of_range bug of multinomial op's cuda kernel #36511

fix out_of_range bug of multinomial op's cuda kernel #36511

pangyoki commented Oct 18, 2021 •

edited

Loading

paddle-bot-old bot commented Oct 18, 2021 •

edited

Loading

paddle-bot-old bot commented Oct 18, 2021

zhiqiu left a comment

paddle-bot-old bot commented Oct 26, 2021

chenwhql left a comment

Avin0323 left a comment

fix out_of_range bug of multinomial op's cuda kernel #36511

fix out_of_range bug of multinomial op's cuda kernel #36511

Conversation

pangyoki commented Oct 18, 2021 • edited Loading

PR types

PR changes

Describe

issue

Reason for error

bug fix

paddle-bot-old bot commented Oct 18, 2021 • edited Loading

paddle-bot-old bot commented Oct 18, 2021

zhiqiu left a comment

Choose a reason for hiding this comment

paddle-bot-old bot commented Oct 26, 2021

chenwhql left a comment

Choose a reason for hiding this comment

Avin0323 left a comment

Choose a reason for hiding this comment

pangyoki commented Oct 18, 2021 •

edited

Loading

paddle-bot-old bot commented Oct 18, 2021 •

edited

Loading