Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【Bug-Fix】 Fix compile bug of cudaxxxAsync in put_along_axis #60934

Merged
merged 1 commit into from
Jan 23, 2024

Conversation

zhwesky2010
Copy link
Contributor

@zhwesky2010 zhwesky2010 commented Jan 18, 2024

PR types

Bug fixes

PR changes

OPs

Description

card-79490

解决 #60673 的问题,修复 #60551 引入的编译Bug

Copy link

paddle-bot bot commented Jan 18, 2024

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

cudaMemsetAsync(shared_mem, 0, sizeof(int) * grad_size, stream);
int64_t grid_memset = (grad_size + block - 1) / block;
CudaMemsetAsync<<<grid_memset, block, 0, stream>>>(
shared_mem + grad_size, 1, sizeof(int) * grad_size);
Copy link
Contributor

@YibinLiu666 YibinLiu666 Jan 18, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

求mean的梯度的时候元素个数初始化为1,也就是后面的shared_mem + grad_size部分,这里好像没有设置为1

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已修改

@YibinLiu666
Copy link
Contributor

其他地方应该没有啥问题了。然后非常感谢研发大哥帮忙修bug(服务器的网挂了,拉不下submodule,给您磕一个

@zhwesky2010 zhwesky2010 force-pushed the fix_bug1 branch 2 times, most recently from 5ef490b to 77a804f Compare January 19, 2024 03:46
@YibinLiu666
Copy link
Contributor

LGTM,麻烦研发大哥了

Copy link
Contributor

@From00 From00 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@zhwesky2010 zhwesky2010 changed the title [Bug-Fix] fix compile bug of cudaxxxAsync due to put_along_axis [Bug-Fix] fix compile bug of cudaxxxAsync in put_along_axis Jan 19, 2024
@zhwesky2010 zhwesky2010 merged commit 064c414 into PaddlePaddle:develop Jan 23, 2024
29 checks passed
zhwesky2010 added a commit to zhwesky2010/Paddle that referenced this pull request Feb 26, 2024
@zhwesky2010 zhwesky2010 changed the title [Bug-Fix] fix compile bug of cudaxxxAsync in put_along_axis 【Bug-Fix】 fix compile bug of cudaxxxAsync in put_along_axis Feb 28, 2024
@zhwesky2010 zhwesky2010 changed the title 【Bug-Fix】 fix compile bug of cudaxxxAsync in put_along_axis 【Bug-Fix】 Fix compile bug of cudaxxxAsync in put_along_axis Feb 28, 2024
XiaoguangHu01 pushed a commit that referenced this pull request Feb 28, 2024
* 【Hackathon 5th No.6】 为 Paddle 增强put_along_axis API -part (#59674)

* fix bug of put_along_axis (#60551)

* Improve the performence of put_along_axis (#60618)

* fix bug of put_along_axis

* improve performence of put_along_axis

* [Bug-Fix] fix compile bug of cudaxxxAsync (#60934)

---------

Co-authored-by: YibLiu <68105073+YibinLiu666@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants