[XPU][PHI Kernels] add scatter_nd_add_grad kernel & bf16 support for slice OPs #58580

lj970926 · 2023-11-01T10:42:30Z

PR types

New features

PR changes

OPs

Description

add scatter_nd_add_grad kernel for xpu
bf16 support for slice、slice_grad、strided_slice、strided_slice_grad
fix bugs in scatter_nd_add when index.numel() == 0
int64 support for reduce_prod and nonzero
fix assertation in full_like kernel when input is nan/inf. The kernel needs to handle these special values and xpu api can deal with them properly.

lj970926 · 2023-11-01T12:14:11Z

paddle/phi/kernels/xpu/scatter_nd_add_kernel.cc

+    int loop_time = static_cast<int>(
+        index_dims_size == 0 ? 1
+                             : phi::product(phi::slice_ddim(
+                                   index.dims(), 0, index_dims_size - 1)));


index tensor最后一维长度为0时，需要按前面所有维度索引updates数组并累加到output中，而不是只累加第一维

lj970926 · 2023-11-01T12:14:42Z

test/xpu/test_scatter_nd_add_op_xpu.py

+            self.index_np = np.array([[[], []], [[], []]]).astype("int32")
+            self.updates_np = np.random.random((2, 2, 10, 10)).astype(
+                self.dtype
+            )


针对scatter_nd_add前向bug的单测修改

RuohengMa · 2023-11-02T03:33:32Z

paddle/phi/kernels/xpu/full_kernel.cc

+  bool is_out_range = true;
+  if (std::isinf(value) || std::isnan(value)) {
+    is_out_range = false;
+  }
+  if ((common_type_value >=
+       static_cast<CommonType>(std::numeric_limits<T>::lowest())) &&
+      (common_type_value <=
+       static_cast<CommonType>(std::numeric_limits<T>::max()))) {
+    is_out_range = false;
+  }


这里感觉要修改一下，common_type_value满足条件之后，就算value里有inf和nan，is_out_range也是false

这个正常行为，用户可以paddle.full_like一个值为Nan的Tensor，这里的逻辑是如果传入的值在数据类型能表示的范围或者是Nan/inf都是合理的。这个参考了GPU实现

Paddle/paddle/phi/kernels/gpu/full_kernel.cu

Line 87 in 5d4320b

bool is_out_range = true;

RuohengMa · 2023-11-02T03:36:19Z

paddle/phi/kernels/xpu/nonzero_kernel.cc

 }

 }  // namespace phi

 PD_REGISTER_KERNEL(
-    nonzero, XPU, ALL_LAYOUT, phi::NonZeroKernel, int, bool, float) {
+    nonzero, XPU, ALL_LAYOUT, phi::NonZeroKernel, int, bool, float, int64_t) {


这里需要在op_list里注册数据类型吗

nonzero在Op list里的名字是where_index，这个已经加了

RuohengMa · 2023-11-02T03:36:28Z

paddle/phi/kernels/xpu/prod_kernel.cc

@@ -50,4 +50,5 @@ void ProdKernel(const Context& dev_ctx,

 }  // namespace phi

-PD_REGISTER_KERNEL(prod, XPU, ALL_LAYOUT, phi::ProdKernel, float) {}
+PD_REGISTER_KERNEL(
+    prod, XPU, ALL_LAYOUT, phi::ProdKernel, float, int, int64_t) {}


prod在Op list里的名字是reduce_prod，这个已经加了

RuohengMa

LGTM

…slice OPs (PaddlePaddle#58580) * bevformer and bf16 support * refine format * refine format * refine format * refine format * fix bugs in compilation

lj970926 added 5 commits November 1, 2023 10:34

bevformer and bf16 support

2b76a2c

refine format

34ae9b5

refine format

a303459

refine format

f4de5d7

refine format

e478d35

lj970926 commented Nov 1, 2023

View reviewed changes

fix bugs in compilation

12c6f59

paddle-bot bot added the contributor External developers label Nov 1, 2023

RuohengMa reviewed Nov 2, 2023

View reviewed changes

RuohengMa approved these changes Nov 2, 2023

View reviewed changes

QingshuChen approved these changes Nov 2, 2023

View reviewed changes

QingshuChen merged commit 038d4b4 into PaddlePaddle:develop Nov 2, 2023

paddle-bot bot removed the contributor External developers label Nov 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[XPU][PHI Kernels] add scatter_nd_add_grad kernel & bf16 support for slice OPs #58580

[XPU][PHI Kernels] add scatter_nd_add_grad kernel & bf16 support for slice OPs #58580

lj970926 commented Nov 1, 2023

lj970926 Nov 1, 2023

lj970926 Nov 1, 2023 •

edited

Loading

RuohengMa Nov 2, 2023

lj970926 Nov 2, 2023

RuohengMa Nov 2, 2023

lj970926 Nov 2, 2023

RuohengMa Nov 2, 2023

lj970926 Nov 2, 2023

RuohengMa left a comment

[XPU][PHI Kernels] add scatter_nd_add_grad kernel & bf16 support for slice OPs #58580

[XPU][PHI Kernels] add scatter_nd_add_grad kernel & bf16 support for slice OPs #58580

Conversation

lj970926 commented Nov 1, 2023

PR types

PR changes

Description

lj970926 Nov 1, 2023

Choose a reason for hiding this comment

lj970926 Nov 1, 2023 • edited Loading

Choose a reason for hiding this comment

RuohengMa Nov 2, 2023

Choose a reason for hiding this comment

lj970926 Nov 2, 2023

Choose a reason for hiding this comment

RuohengMa Nov 2, 2023

Choose a reason for hiding this comment

lj970926 Nov 2, 2023

Choose a reason for hiding this comment

RuohengMa Nov 2, 2023

Choose a reason for hiding this comment

lj970926 Nov 2, 2023

Choose a reason for hiding this comment

RuohengMa left a comment

Choose a reason for hiding this comment

lj970926 Nov 1, 2023 •

edited

Loading