-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
【Bug-Fix】 Fix compile bug of cudaxxxAsync in put_along_axis #60934
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
cudaMemsetAsync(shared_mem, 0, sizeof(int) * grad_size, stream); | ||
int64_t grid_memset = (grad_size + block - 1) / block; | ||
CudaMemsetAsync<<<grid_memset, block, 0, stream>>>( | ||
shared_mem + grad_size, 1, sizeof(int) * grad_size); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
求mean的梯度的时候元素个数初始化为1,也就是后面的shared_mem + grad_size部分,这里好像没有设置为1
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已修改
其他地方应该没有啥问题了。然后非常感谢研发大哥帮忙修bug(服务器的网挂了,拉不下submodule,给您磕一个 |
5ef490b
to
77a804f
Compare
LGTM,麻烦研发大哥了 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
* 【Hackathon 5th No.6】 为 Paddle 增强put_along_axis API -part (#59674) * fix bug of put_along_axis (#60551) * Improve the performence of put_along_axis (#60618) * fix bug of put_along_axis * improve performence of put_along_axis * [Bug-Fix] fix compile bug of cudaxxxAsync (#60934) --------- Co-authored-by: YibLiu <68105073+YibinLiu666@users.noreply.github.com>
PR types
Bug fixes
PR changes
OPs
Description
card-79490
解决 #60673 的问题,修复 #60551 引入的编译Bug