implement conv3d op #4400

optima2005 · 2019-11-22T13:39:14Z

This is a start attempt to implement #4009

The implemation in this PR is a very basic version of conv3d( NCDHW layout only). I am proposing this version to confirm that I am in the correct direction. And if so, I think I can go on or others can keep working it from this on.

I can run through the below testing on a x86 cpu and a nvidia gpu server, both linux OS.
TVM_FFI=ctypes python -m pytest -v tests/python/relay/test_op_level2.py -k test_conv3d_run
TVM_FFI=ctypes python -m pytest -v tests/python/contrib/test_cudnn.py -k test_conv3d

masahi · 2019-11-23T03:06:09Z

python/tvm/contrib/cudnn.py

+                   filter_d,
+                   filter_h,
+                   filter_w):
+    """Get weight shape for a 2D convolution


will fix next commit

masahi · 2019-11-23T03:06:19Z

python/tvm/contrib/cudnn.py

+        filter height
+    filter_w: int
+        filter width
+


will fix next commit

masahi · 2019-11-23T03:11:34Z

python/tvm/contrib/cudnn.py

+        0: CUDNN_CONVOLUTION
+        1: CUDNN_CROSS_CORRELATION
+    tensor_format: int
+        0: CUDNN_TENSOR_NCHW


Is this value compatible with 3D convolution?

Yes. cudnn doesn't have specific tensor format for 3D and up dimentions convolution, the leading format is same "NCHW" and the additional dimentions would be append to the tail for 'channel first' format. Please see below citation from nvidia DL SDK

format Input.Type of the filter layout format. If this input is set to CUDNN_TENSOR_NCHW, which is one of the enumerant values allowed by cudnnTensorFormat_t descriptor, then the layout of the filter is as follows: For N=4, a 4D filter descriptor, the filter layout is in the form of KCRS: K represents the number of output feature maps C is the number of input feature maps R is the number of rows per filter S is the number of columns per filter For N=3, a 3D filter descriptor, the number S (number of columns per filter) is omitted. For N=5 and greater, the layout of the higher dimensions immediately follow RS. On the other hand, if this input is set to CUDNN_TENSOR_NHWC, then the layout of the filter is as follows: For N=4, a 4D filter descriptor, the filter layout is in the form of KRSC. For N=3, a 3D filter descriptor, the number S (number of columns per filter) is omitted and the layout of C immediately follows R. For N=5 and greater, the layout of the higher dimensions are inserted between S and C. For more information, see cudnnTensorFormat_t.

masahi · 2019-11-23T03:15:25Z

src/relay/op/nn/convolution.cc

+- **weight**: (channels, in_channels, kernel_size[0], kernel_size[1])
+- **out**:  This depends on the `layout` parameter. Output is 4D array of shape
+            (batch_size, channels, out_height, out_width) if `layout` is `NCHW`.
+


need to update document above for 3D

masahi · 2019-11-23T03:30:25Z

@optima2005 thank you very much for working on this. Can we use cuDNN's ND API like cudnnSetConvolutionNdDescriptor for 2D convolution? It would be great if we could unify 2D and 3D implementation.

masahi · 2019-11-23T03:41:43Z

@optima2005 can you add test cases to topi/tests/python too?

optima2005 · 2019-11-23T08:22:55Z

@masahi, glad to contribute! Many thanks for the review!

For "unify 2D and 3D" cudnn convolution, I am wondering whether it is better to keep separation. The APIs are 2 groups in cudnn lib(2D vs ND), I guess whether there would be some specific optimazation for 2D group. What do you think about that?

For other comments, I have revised in the latest commit, please check again, thanks a lot!

masahi · 2019-11-23T15:14:38Z

I also wondered if there is a performance penalty in using ND api for 2D convolution. I looked at pytorch's implemention to see how they deal with ND convolution. It seems they are using ND API for all dimension, including 2D. See here and here.

masahi · 2019-11-23T15:19:57Z

So I think we can go with the ND API for both 2D and 3D convolution. If you are worried about performance you can always do benchmarks.

Can you first send a PR that refactors our cuDNN convolution to use the ND API? It will make reviewing this PR easier.

optima2005 · 2019-11-25T05:34:11Z

@masahi, I raised #4418 that refactors the cuDNN convolution to use the ND API.

optima2005 · 2019-12-04T02:44:54Z

@masahi, I have rebased to the master. Please go on to review. Thanks!

topi/python/topi/cuda/conv3d.py

masahi · 2019-12-04T08:44:13Z

Thanks @optima2005 this is merged.

* implement conv3d op * add back missed conv2d_output_shape by mistake * fix typo and docs, add topi test * rebase to master and merge 2d/3d unification * use cudnn.conv_forward

deepakbabel · 2019-12-19T06:04:33Z

Hi @optima2005 , @masahi ,
I am new to tvm community. I was trying to implement maxpool3d operator based on reference implementation from tensorflow(r=1.14) into tvm codebase. For testing the same, I have written relay and topi test cases. I am also writing test cases in test_forward.py(for tensorflow) which compares tvm output with tensorflow output. I want to know is there any specific reason why you have not added any changes in test_forward.py(comparing tvm with tf) for unit testing?? Should i ignore this file altogether?

kevinthesun added the status: need review label Nov 22, 2019

masahi self-assigned this Nov 22, 2019

masahi reviewed Nov 23, 2019

View reviewed changes

optima2005 mentioned this pull request Nov 25, 2019

[RUNTIME] Add cudnn conv3d #4418

Merged

optima2005 added 4 commits December 4, 2019 01:49

implement conv3d op

a33e978

add back missed conv2d_output_shape by mistake

e2de19f

fix typo and docs, add topi test

f138f37

rebase to master and merge 2d/3d unification

828127e

optima2005 force-pushed the add_conv3d branch from 83e05ac to 828127e Compare December 4, 2019 02:41

masahi reviewed Dec 4, 2019

View reviewed changes

topi/python/topi/cuda/conv3d.py Outdated Show resolved Hide resolved

use cudnn.conv_forward

9005358

masahi approved these changes Dec 4, 2019

View reviewed changes

masahi merged commit 7e32f37 into apache:master Dec 4, 2019

zhiics mentioned this pull request Sep 15, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement conv3d op #4400

implement conv3d op #4400

optima2005 commented Nov 22, 2019

masahi Nov 23, 2019

optima2005 Nov 23, 2019

masahi Nov 23, 2019

optima2005 Nov 23, 2019

masahi Nov 23, 2019

optima2005 Nov 23, 2019

masahi Nov 23, 2019

masahi commented Nov 23, 2019

masahi commented Nov 23, 2019

optima2005 commented Nov 23, 2019

masahi commented Nov 23, 2019 •

edited

Loading

masahi commented Nov 23, 2019

optima2005 commented Nov 25, 2019

optima2005 commented Dec 4, 2019

masahi commented Dec 4, 2019

deepakbabel commented Dec 19, 2019

implement conv3d op #4400

implement conv3d op #4400

Conversation

optima2005 commented Nov 22, 2019

masahi Nov 23, 2019

Choose a reason for hiding this comment

optima2005 Nov 23, 2019

Choose a reason for hiding this comment

masahi Nov 23, 2019

Choose a reason for hiding this comment

optima2005 Nov 23, 2019

Choose a reason for hiding this comment

masahi Nov 23, 2019

Choose a reason for hiding this comment

optima2005 Nov 23, 2019

Choose a reason for hiding this comment

masahi Nov 23, 2019

Choose a reason for hiding this comment

masahi commented Nov 23, 2019

masahi commented Nov 23, 2019

optima2005 commented Nov 23, 2019

masahi commented Nov 23, 2019 • edited Loading

masahi commented Nov 23, 2019

optima2005 commented Nov 25, 2019

optima2005 commented Dec 4, 2019

masahi commented Dec 4, 2019

deepakbabel commented Dec 19, 2019

masahi commented Nov 23, 2019 •

edited

Loading