Implement SparseConv3d kernel #39784

zkh2016 · 2022-02-21T09:25:43Z

PR types

New features

PR changes

OPs

Describe

实现SparseConv3d的kenrel，算法采用Second，参考串行代码PR.

性能情况：与spconv的native实现对比gpu kernel耗时：v100上，对比单测，其中 X(2, 400,400, 15, 17), kernel(3, 3, 3, 17, 19) : fp32 1.3x加速比，fp16 1.4x加速比。

paddle-bot-old · 2022-02-21T09:26:16Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

xingfeng01 · 2022-02-22T02:23:25Z

paddle/phi/kernels/sparse/gpu/convolution_kernel.cu

+  }
+  __syncthreads();
+  for (int i = threadIdx.x; i < kernel_size; i += blockDim.x) {
+    atomicAdd(&counter[i], counter_buf[i]);


这里为什么需要原子操作呢？

这里分块进行计数，然后在atomicAdd到global中，后面考虑进一步优化。

xingfeng01 · 2022-02-22T02:24:49Z

paddle/phi/tests/kernels/test_sparse_conv3d_dev_api.cc

+      0.7473, 0.5403, 0.5391, 0.0796, 0.4734, 0.9097, 0.1712, 0.6237, 0.8837};
+
+  std::vector<std::vector<int>> out_indices = {
+      // {0, 0, 0, 0},


建议删除无用的注释

xingfeng01 · 2022-02-22T02:25:16Z

paddle/phi/tests/kernels/test_sparse_conv3d_dev_api.cc

+
+      0.7473, 0.5403, 0.5391, 0.0796, 0.4734, 0.9097, 0.1712, 0.6237, 0.8837};
+
+  std::vector<std::vector<int>> out_indices = {// {0, 0, 0, 0},


建议删除无用的注释，如下

AnnaTrainingG · 2022-02-28T09:22:52Z

paddle/phi/kernels/sparse/gpu/convolution_kernel.cu

+namespace sparse {
+
+// TODO(zhangkaihuo) replace this kernel with KP::InitWithDataIndex
+__global__ void InitByIndexKernel(const int n, int* out1, int* out2) {


可以直接复用现有代码： https://github.com/PaddlePaddle/Paddle/pull/39666/files

等这个代码迁移到phi下面后再进行复用。

已在这个PR中替换使用kp：https://github.com/PaddlePaddle/Paddle/pull/40143/files#diff-0bd729a3c3ccb75aa95c719fd0f632533d2629b0822d3d972da0abbdc3923b58R100

AnnaTrainingG · 2022-03-02T03:46:16Z

paddle/phi/kernels/sparse/gpu/convolution_kernel.cu

+// this kernel with phi::GatherCUDAKernel;
+template <typename T, typename IndexT = int>
+__global__ void GatherKernel(const T* params,
+                             const IndexT* indices,


加点注释

zkh2016 added 10 commits February 17, 2022 09:24

fix incorrect dims settings

6d4f2fa

sparse conv3d

ec6eed3

fix out dims

dc8d707

test performance

fa365cb

test large shape success

bb1c375

opt scatter, double performance

99c3c41

test float16

621fae1

remove profiling code

2832f05

merge upstream develop

c413e96

remove pten

271eea6

xingfeng01 reviewed Feb 22, 2022

View reviewed changes

zkh2016 added 2 commits February 22, 2022 07:59

opt code lines

904d664

correct boundary judgment

2eea16b

zkh2016 mentioned this pull request Feb 24, 2022

Add sparse conv3d kernel #39879

Merged

merge upstream

a0c8714

AnnaTrainingG reviewed Feb 28, 2022

View reviewed changes

zkh2016 added 2 commits March 1, 2022 12:25

fix:used wrong place

4798f56

adaptive rocm

199f013

AnnaTrainingG reviewed Mar 2, 2022

View reviewed changes

add comments to code

4f2291d

xingfeng01 approved these changes Mar 3, 2022

View reviewed changes

AnnaTrainingG approved these changes Mar 3, 2022

View reviewed changes

zkh2016 merged commit 6bf85ea into PaddlePaddle:develop Mar 3, 2022

zkh2016 deleted the conv3d branch August 19, 2022 04:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement SparseConv3d kernel #39784

Implement SparseConv3d kernel #39784

zkh2016 commented Feb 21, 2022 •

edited

Loading

paddle-bot-old bot commented Feb 21, 2022

xingfeng01 Feb 22, 2022

zkh2016 Mar 2, 2022

xingfeng01 Feb 22, 2022

zkh2016 Mar 2, 2022

xingfeng01 Feb 22, 2022

zkh2016 Mar 2, 2022

AnnaTrainingG Feb 28, 2022

zkh2016 Mar 2, 2022

zkh2016 Mar 9, 2022

AnnaTrainingG Mar 2, 2022

zkh2016 Mar 2, 2022


		0.7473, 0.5403, 0.5391, 0.0796, 0.4734, 0.9097, 0.1712, 0.6237, 0.8837};

		std::vector<std::vector<int>> out_indices = {// {0, 0, 0, 0},

Implement SparseConv3d kernel #39784

Implement SparseConv3d kernel #39784

Conversation

zkh2016 commented Feb 21, 2022 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Feb 21, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zkh2016 commented Feb 21, 2022 •

edited

Loading