-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cudaSuccess == cudaStat (0 vs. 77) Cuda Error: an illegal memory access was encountered #1399
Labels
Comments
问题不能解决,代码用的最新的master下编译的 依然有这个问题 |
感觉这个额问题主要跟batch size有关系,越大越不容易出现,但是最终一定会出现。。 |
存在一个gpu卡分到的数据是0就会报这个错误吧, 比如4卡, 最后按照batchsize算只有3个sequence, 就会这样, 试试改改dataprovider |
wangxicoding
pushed a commit
to wangxicoding/Paddle
that referenced
this issue
Dec 9, 2021
Co-authored-by: smallv0221 <33639025+smallv0221@users.noreply.github.com>
lizexu123
pushed a commit
to lizexu123/Paddle
that referenced
this issue
Feb 23, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
多GPU训练的时候提示错误:
问题的具体表现形式是:
神经网络很简单,就是一个600000 * 256的fc layer。
trainer_config:
dataprovider:
The text was updated successfully, but these errors were encountered: