Softmax grad op #3164

jacquesqiao · 2017-08-02T07:59:06Z

reyoung · 2017-08-02T08:44:11Z

paddle/operators/softmax_op.cc

+    PADDLE_ENFORCE(ctx.InputVar(1) != nullptr, "Input(1) should not be null");
+    PADDLE_ENFORCE(ctx.Input<Tensor>(0)->dims() == ctx.Input<Tensor>(1)->dims(),
+                   "the shape of Input(0) and Input(1) should be the same");
+    ctx.Output<Tensor>(0)->ResizeLike(*ctx.Input<Tensor>(0));


I do not think ResizeLike method is needed. Because it does not simply our implementation.

reyoung · 2017-08-02T08:45:38Z

paddle/operators/softmax_op.h

+    for (int i = 0; i < batch_size; ++i) {
+      for (int j = 0; j < class_num; ++j) {
+        auto index = i * batch_size + j;
+        scale_->data<T>()[i] += Y->data<T>()[index] * dY->data<T>()[index];


That code obviously cannot be run on GPU.

reyoung · 2017-08-02T08:46:11Z

python/paddle/v2/framework/tests/test_softmax_op.py

@@ -19,5 +19,30 @@ def setUp(self):
        self.Y = np.apply_along_axis(stable_softmax, 1, self.X)


+class TestSoftmaxGradOp(unittest.TestCase):


The Gradient of Operator does not need unit test like this.

… into softmax_grad_op

… softmax_grad_op

emailweixu · 2017-08-03T04:53:23Z

paddle/operators/softmax_op.h

+            Y->data<T>()[index] * (dY->data<T>()[index] - scale_->data<T>()[i]);
+      }
+    }
+  }


Is this only for CPU?

I am now using Eigen to rewrite this compute function and will support GPU.

@emailweixu support GPU and add python unit test done

gangliao · 2017-08-03T08:13:40Z

paddle/operators/softmax_op.h

+    auto dX_eigen = EigenMatrix<T>::From(*dX);
+    auto place = context.GetEigenDevice<Place>();
+
+    dX_eigen.device(place) = dY_eigen;


auto dot = (Y_eigen * dY_eigen) .sum(along_class) .eval() .reshape(batch_by_one) .broadcast(one_by_class); dX_eigen.device(place) = (dY_eigen - dot) * Y_eigen;

.device will evaluate the expression so it's better to remove it.

reyoung

LGTM, except some tiny problems

reyoung · 2017-08-03T12:13:15Z

paddle/operators/softmax_op.cc

-    LOG(INFO) << "SoftmaxOpGrad";
-    return "";
+  void InferShape(const InferShapeContext &ctx) const override {
+    PADDLE_ENFORCE(ctx.InputSize() == 3,


Maybe change ctx.InputSize() == 3Ul, to prevent warning.

reyoung · 2017-08-03T12:13:58Z

paddle/operators/softmax_op.cc

+                   "Input of SoftmaxOpGrad should be 3, X, Y, YG");
+    PADDLE_ENFORCE(ctx.OutputSize() == 1,
+                   "Output of SoftmaxOpGrad should be 1");
+    PADDLE_ENFORCE(ctx.InputVar("Y") != nullptr, "Input(0) should not be null");


"Input("Y") should not be null"

jacquesqiao added 2 commits August 2, 2017 07:37

init softmax grad op

627a546

add compute code

79a1e8c

reyoung reviewed Aug 2, 2017

View reviewed changes

jacquesqiao added 8 commits August 2, 2017 17:16

export Backward to python

2fabd7b

update unit test

c9c8b3b

update test ,export op.type to python

3e07d6f

update unit test

e02adb9

Merge branch 'export-backward' of https://github.com/jacquesqiao/Paddle…

8d8c049

… into softmax_grad_op

update python test, fix compute bug

c95d916

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

b49e498

… softmax_grad_op

update unit test

b234f6f

jacquesqiao changed the title ~~[wip]Softmax grad op~~ Softmax grad op Aug 3, 2017

emailweixu reviewed Aug 3, 2017

View reviewed changes

use eigen

2744be8

jacquesqiao requested review from gangliao and QiJune August 3, 2017 07:53

gangliao reviewed Aug 3, 2017

View reviewed changes

jacquesqiao added 4 commits August 3, 2017 16:26

optimize eigen code

65aba8f

add gpu test

3d02e6f

register softmax_grad GPU kernel and fix test bug

121289d

typo

4dae133

jacquesqiao requested a review from wangkuiyi August 3, 2017 11:32

reyoung approved these changes Aug 3, 2017

View reviewed changes

follow comments

b31f2da

jacquesqiao merged commit d953611 into PaddlePaddle:develop Aug 3, 2017

heavengate pushed a commit to heavengate/Paddle that referenced this pull request Aug 16, 2021

update ce (PaddlePaddle#3164)

3a31265

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Softmax grad op #3164

Softmax grad op #3164

jacquesqiao commented Aug 2, 2017 •

edited

Loading

reyoung Aug 2, 2017

jacquesqiao Aug 3, 2017

reyoung Aug 2, 2017 •

edited

Loading

reyoung Aug 2, 2017

emailweixu Aug 3, 2017

jacquesqiao Aug 3, 2017

jacquesqiao Aug 3, 2017

gangliao Aug 3, 2017

gangliao Aug 3, 2017

reyoung left a comment

reyoung Aug 3, 2017

jacquesqiao Aug 3, 2017

reyoung Aug 3, 2017

		@@ -19,5 +19,30 @@ def setUp(self):
		self.Y = np.apply_along_axis(stable_softmax, 1, self.X)


		class TestSoftmaxGradOp(unittest.TestCase):

Softmax grad op #3164

Softmax grad op #3164

Conversation

jacquesqiao commented Aug 2, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

reyoung Aug 2, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

reyoung left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jacquesqiao commented Aug 2, 2017 •

edited

Loading

reyoung Aug 2, 2017 •

edited

Loading