New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

refine the mkldnn logic #4953

Merged

luotao1 merged 6 commits into PaddlePaddle:develop from tensor-tang:merge_grad_gtest

Oct 23, 2017

Contributor

tensor-tang commented Oct 20, 2017 •

edited

Loading

去掉了MKLDNNLayer子类的resetInValue等类似的函数。在父类实现，子类只负责调用。
稍微修改了下MKLDNNMatrix::create的接口，把MatrixPtr放到后面，作为不是必须参数。
创建了MKLNNLayer.cpp文件，把必要的函数实现放在了cpp文件中。
修改了下vlog的顺序，和gtest的参数size加快gtest的速度。

tensor-tang added 5 commits

October 19, 2017 14:34


          refine mkldnn logic, move reset buffers into MKLDNNLayer

c191454


          change MKLDNNMatrix create interface since MatrixPtr is not always re…

9e38daf

…quired


          add cpp file of MKLDNNLayer

94e442d


          refine the gtest log info and vlog order,

d75b00c

and change the size of test to make unit test faster
refine comment and log of mkldnnlayer


          Merge remote-tracking branch 'upstream/develop' into merge_grad_gtest

4f41eaf

tensor-tang requested a review from luotao1

October 20, 2017 01:56

tensor-tang force-pushed the merge_grad_gtest branch from 004d812 to 4f41eaf Compare

October 20, 2017 08:49

luotao1 reviewed

View reviewed changes

Contributor

luotao1 left a comment •

edited

Loading

CHECK的部分需要重新梳理下（可以在下一个PR改，这个PR内容有点多）：

一部分重复的，可以定义宏来简化。
每个CHECK里面，只检查一个内容，现在有大量的&&在CHECK中。如果这一行挂了，不能定位到底是哪个出错。
CHECK(outVal_ != nullptr)写成CHECK(outVal_ )即可。

再次希望PR能少一点，便于理解、review和尽快merge。比如这里的第四个commit（refine the gtest log info and vlog order, and change the size of test to make unit test faster refine comment and log of mkldnnlaye），和其他几个无关，建议拆出来。

paddle/gserver/layers/MKLDNNConvLayer.cpp

+                      bias, biases_->getWGrad(), wgtPD->diff_bias_primitive_desc());
+                  CHECK(bias && biasVal_ &&
+                        bias->getPrimitiveDesc() == biasVal_->getPrimitiveDesc())
+                      << "primitive desc of bias grad should equal the bias value";

Contributor

luotao1 Oct 20, 2017

313-315行，321-323行，这些check也能抽出一个函数么？
看MKLDNN相关的代码，CHECK很多，可以写个宏之类来定义下不同的CHECK么，这样能省去不少代码。
可以参考PADDLE_ENFORCE

Contributor Author

tensor-tang Oct 20, 2017

嗯，可以考虑下，以后来看看能不抽象出一个宏定义来。

paddle/gserver/layers/MKLDNNConvLayer.cpp Outdated

+                if (biases_ == nullptr || biases_->getW() == nullptr) {
+                  return;
+                }
+                resetWithMatrix(bias, biases_->getW(), pd->bias_primitive_desc());

Contributor

luotao1 Oct 20, 2017

这里写成下面这种会比较简短：

if (biases_  && biases_->getW() ) {
  resetWithMatrix(bias, biases_->getW(), pd->bias_primitive_desc());
}

Contributor Author

tensor-tang Oct 20, 2017

嗯，可以的，我可以后面一起改掉，bias的问题也需要在整洁下。

paddle/gserver/layers/MKLDNNConvLayer.cpp Outdated


		resetOutValue(pd, out);
		bias = nullptr;

Contributor

luotao1 Oct 20, 2017

从resetWithMatrix实现来看，213行可以去掉。

Contributor Author

tensor-tang Oct 20, 2017 •

edited

Loading

ok，可以按照if else来，默认最好清个零。

paddle/gserver/layers/MKLDNNConvLayer.cpp

+                if (dataPD == nullptr) {
+                  return;
+                }
+                resetInGrad(in, dataPD->diff_src_primitive_desc());
                 resetWgtValBwdData(dataPD, wgtValBwdData_);

Contributor

luotao1 Oct 20, 2017

326行-330行，写成：

if (dataPD) {
  resetInGrad(in, dataPD->diff_src_primitive_desc());
  resetWgtValBwdData(dataPD, wgtValBwdData_);
}

Contributor Author

tensor-tang Oct 20, 2017

我觉得现在原来那样写好一点，没有就return。
逻辑也清楚点。并且缩进也少点。
如果觉得用nullptr不好，可以写为if(!dataPD)。或者之后专门留PR来整理这些。

paddle/gserver/layers/MKLDNNConvLayer.cpp

@@ @@ -431,99 +350,13 @@ void MKLDNNConvLayer::resetBwdPipeline( @@
                 if (dataPD == nullptr) {
                   return;
                 }

Contributor

luotao1 Oct 20, 2017

同上，写成：

if (dataPD) {
  XXX;
}

Contributor Author

tensor-tang Oct 20, 2017

同上。

paddle/gserver/layers/MKLDNNLayer.cpp

+                    << "only support input is MKLDNN layer or only have one output layer";
+                // when input is a mkldnn branch node,
+                // this layer will save input grad to a internal buffer,
+                // and the mkldnn input layer will merge them to actual prev->output_.grad

Contributor

luotao1 Oct 20, 2017

the mkldnn input layer will merge them，merge是在input里面做的？

Contributor Author

tensor-tang Oct 20, 2017

嗯，在resetInGrad里面。

paddle/gserver/layers/MKLDNNLayer.cpp Outdated

+                Argument& arg = input->getOutput(this->getName());
+                arg.grad = std::dynamic_pointer_cast<Matrix>(in);
+                CHECK(inVal_ != nullptr && inVal_->getPrimitiveDesc() == intPD)
+                    << "should have internal input value and primitive desc must equal";

Contributor

luotao1 Oct 20, 2017

这里写成

CHECK(inVal_);
CHECK_EQ(inVal_->getPrimitiveDesc(), intPD) << "internal input value and primitive desc must equal";

一行check一个值，比较好。不然这里挂了，不知道是inval空导致的，还是不等于导致的。

Contributor Author

tensor-tang Oct 20, 2017

好的，我也觉得这样写好。

paddle/gserver/layers/MKLDNNLayer.h Outdated

+                /// The output_.value and output_.grad always save the external data,
+                /// when mixed with cpu device.
+                /// When all layers are mkldnn layers, they could save internal data.
+                /// below MKLDNNMatrix buffers are all internal buffers

Contributor

luotao1 Oct 20, 2017

每一行开头要大写，Value，Each都要大写。
is always nchw of nc，这里是of还是or？
67行和68行中间要空一行。
61-67行可以用/* */来注释。

paddle/gserver/layers/MKLDNNLayer.h Outdated

+                 * reset output grad from internal primitive desc.
+                 * merge grad if necessary.
+                 * reset both internal and external buffer and create reorder if necessary.
+                 * note: about merge grad, when this layer has serval outputs,

Contributor

luotao1 Oct 20, 2017

serval-》several

paddle/gserver/layers/MKLDNNLayer.h Outdated

                 /**
-                 * Print the mkldnn memory format flow of value
+                 * reset the merge grad primitive if necessary.
+                 * note: do not support the grads are mixed with cpu device,

Contributor

luotao1 Oct 20, 2017

do not support the grads mixed with cpu device
去掉are，一个句子里面只有一个动词。


          remove unused code

5c892db

refine comments and bias
fix typo and todo

Contributor Author

tensor-tang commented Oct 20, 2017

Done.
The check macro will finish next time.

luotao1 approved these changes

View reviewed changes

Contributor

luotao1 left a comment

LGTM

luotao1 merged commit abce9eb into PaddlePaddle:develop

tensor-tang deleted the merge_grad_gtest branch

October 23, 2017 01:55

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet