-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[NPU] add broadcast supporting for elementwise_add_op_npu #34057
Conversation
Thanks for your contribution! |
d66a446
to
18ea729
Compare
18ea729
to
14647a9
Compare
14647a9
to
e14fdf7
Compare
ops::ElementwiseAddGradNPUKernel<float>, | ||
ops::ElementwiseAddGradNPUKernel<plat::float16>); | ||
ops::ElementwiseAddGradWithAxisNPUKernel<float>, | ||
ops::ElementwiseAddGradWithAxisNPUKernel<plat::float16>); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
原有的ElementwiseAddGradNPUKernel是不是不再被调用了,是的话可以删掉,避免冗余代码。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已删除
See the License for the specific language governing permissions and | ||
limitations under the License. */ | ||
#pragma once | ||
#ifdef PADDLE_WITH_ASCEND_CL |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
#ifdef PADDLE_WITH_ASCEND_CL 可以删掉
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已更新
self.y = np.random.rand(10, 1, 12, 1).astype(self.dtype) | ||
self.out = self.x + self.y | ||
|
||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
单测问题不大,如果可以的话,最好能参考test_elementwise_add_op.py里面的单测case把,里面的case都加上,不支持最好注释一下说明原因,方便以后debug。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已更新
fa454c3
to
8cd53cf
Compare
f421c5b
to
e690f19
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
e812419
to
79a4d2a
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for ShareDataWith
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for skip_check_grad_ci
PR types
Others
PR changes
OPs
Describe
add broadcast supporting for elementwise_add_op_npu
elementwise_add_grad使用NPU算子ReduceSumD在训练resnet50时,发现会在几个epoch之后挂掉(acl error code 507015)