Add CPU and GPU eigh op implementation #34990

Zjq9409 · 2021-08-18T06:26:01Z

PR types

New features

PR changes

OPs

Describe

添加 paddle.linalg.eigh() API，支持CPU和GPU计算.

paddle-bot-old · 2021-08-18T06:26:04Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Xreki · 2021-09-01T10:16:51Z

python/paddle/tensor/linalg.py

@@ -946,53 +946,42 @@ def __check_input(x, vec):
 def matrix_power(x, n, name=None):
    r"""
    Computes the n-th power of a square matrix or a batch of square matrices.
-


不要改别的API。

Xreki · 2021-09-01T10:18:44Z

paddle/fluid/operators/eigh_helper.h

+      op_inputs[name] = name_vector;
+    }
+    auto op =
+        framework::OpRegistry::CreateOp(type, op_inputs, op_outputs, attrs);


这是需要在eigh算子里面创建别的op？不建议使用这种方式。

好的，正在修改

Zjq9409 · 2021-09-14T04:36:46Z

python/paddle/fluid/tests/unittests/test_eigh_op.py

+        self.x_np = np.random.random(self.x_shape).astype(self.x_type)
+
+    def test_check_output(self):
+        self.check_output(no_check_set=['Eigenvectors'])


这里对于去掉'Eigenvectors'解释：
1、特征值是唯一的，但是特征向量不唯一；
2、numpy采用Lapack库，paddle目前采用eigen库
3、eigen计算出来的特征向量与lapack会有一个符号上有区别，比如：
eigen计算出来特征向量 [[2, 3],[4, 5]]， Lapack计算出来的为[[-2, 3],[-4, 5]]，第一列符号不同，但是结果都是争取的。

需要解释一下为什么会出现符号上的差别。以及说明下在其他单测里面会检查Eigenvectors。

Xreki

LGTM. 这个PR改的差不多了，还有其他算子依赖这个PR，因此建议先合入，后续再提PR修复上述review建议。

Xreki · 2021-09-15T03:39:57Z

paddle/fluid/operators/eigh_op.cc

+    std::vector<int64_t> v_dim = {input_dim[1]};
+    if (rank > 2) {
+      v_dim = {batch_size, input_dim[1]};
+    }


这里也没必要用if else两个分支写。

Xreki · 2021-09-15T05:05:45Z

paddle/fluid/operators/eigh_op.cc

+    OP_INOUT_CHECK(ctx->HasInput("Eigenvectors"), "Input", "Eigenvectors",
+                   "EighGrad");
+    OP_INOUT_CHECK(ctx->HasInputs(framework::GradVarName("Eigenvalues")),
+                   "Input", "Eigenvalues@GRAD", "EighGrad");


那这里应该使用ctx->HasInput而不是ctx->HasInputs，没有s？

Xreki · 2021-09-15T05:10:22Z

paddle/fluid/operators/eigh_op.cu

+template <typename ValueType, typename T>
+class EighGPUKernel : public framework::OpKernel<T> {
+ public:
+  void Compute(const framework::ExecutionContext &ctx) const override {


实现和EighKernel一样，GPU也可以直接使用EighKernel来注册，没有必要实现EighGPUKernel。

Xreki · 2021-09-15T05:19:06Z

paddle/fluid/operators/eigh_op.h

+  void Compute(const framework::ExecutionContext& ctx) const override {
+    auto& x_grad = *ctx.Output<framework::Tensor>(framework::GradVarName("X"));
+    x_grad.mutable_data<T>(ctx.GetPlace());
+    auto& output_w_var = *ctx.Input<Tensor>("Eigenvalues");


变量名为啥加_var后缀，直接用output_w？

Xreki · 2021-09-15T05:20:08Z

paddle/fluid/operators/eigh_op.h

+using EigenTensor = framework::EigenTensor<T, D, MajorType, IndexType>;
+template <typename T, int MajorType = Eigen::RowMajor,
+          typename IndexType = Eigen::DenseIndex>
+using EigenVector = framework::EigenVector<T, MajorType, IndexType>;


这部分Eigen的声明可以删除了？

Xreki · 2021-09-15T05:34:13Z

paddle/fluid/operators/math/eigen_values_vectors.h

+
+}  // namespace math
+}  // namespace operators
+}  // namespace paddle


所有代码都实现在头文件里面了吗？编译可能会较慢。

Xreki · 2021-09-15T05:35:11Z

paddle/fluid/operators/math/eigen_values_vectors.h

+// symmetric matrices, and uses the variable compute_vectors to
+// control whether to return the eigenvectors.
+template <typename DeviceContext, typename ValueType, typename T>
+struct MatrixEighFunctorCPU {


CPU、GPU的Functor都叫MatrixEighFunctor，针对CPUDeviceContext、CUDADeviceContext特化实现。

Xreki · 2021-09-15T05:41:15Z

paddle/fluid/operators/svd_helper.h

+  }
+
+  // Support x and y are different data types
+  Tensor Div_(const Tensor& x, const Tensor& y) {


函数名不应该以_区分功能，以及这个是不是可以直接实现在Div里面，判断一下x和y的dtype是不是相同？但其实x和t的dtype也是有要求的，一个是实数，一个是实数对应的复数？

Xreki · 2021-09-15T05:43:53Z

paddle/fluid/operators/svd_helper.h

+    return out;
+  }
+
+  framework::Tensor Sub_(const framework::Tensor& x,


这个Sub为啥要重新实现？若是因为原来的Sub没有调用InverseFunctor，我也认为是Sub实现的bug，应该直接修改原Sub函数。另外，Sub的修改参考下Div，GPU是不需要InverseFunctor的。

Xreki · 2021-09-15T05:49:47Z

python/paddle/fluid/tests/unittests/test_eigh_op.py

+        self.x_np = np.random.random(self.x_shape).astype(self.x_type)
+
+    def test_check_output(self):
+        self.check_output(no_check_set=['Eigenvectors'])


需要解释一下为什么会出现符号上的差别。以及说明下在其他单测里面会检查Eigenvectors。

cryoco

LGTM for no_check_set

jzhang533

lgtm

jzhang533

lgtm

jiangjiajun

LGTM

jzhang533

LGTM

add CPU Eigh op

7a62db2

Zjq9409 added 12 commits August 18, 2021 06:36

add file

512613a

add file

1461d1a

Merge branch 'EighOP' of https://github.com/Zjq9409/Paddle into EighOP

2dda0d1

modify head file path

cca0bdb

modify cmake file

d1bb551

add test

fd50e3f

Merge branch 'develop' into EighOP

e386f6c

merge conflict

c8218bd

add test

b29f124

modify head file

f9bdc21

test

c96121e

test

1c9ecc2

Zjq9409 changed the title ~~add CPU Eigh op~~ add Eigh op Aug 23, 2021

Zjq9409 added 7 commits August 25, 2021 13:12

add backward

dbbebd2

add backward

ea7cc0f

add tool

d945247

add backward test

ad9a412

Merge branch 'develop' into EighOP

2b01c35

Merge branch 'develop' into EighOP

27230b6

Merge branch 'EighOP' of https://github.com/Zjq9409/Paddle into EighOP

b0ad2b4

Zjq9409 changed the title ~~add Eigh op~~ Add CPU and GPU eigh op implementation Aug 30, 2021

Zjq9409 added 5 commits August 30, 2021 03:33

Merge branch 'EighOP' of https://github.com/Zjq9409/Paddle into EighOP

a2b8897

Merge branch 'EighOP' of https://github.com/Zjq9409/Paddle into EighOP

af8a892

Modify the configuration file

a08dd88

Modify the configuration file

1e4c267

Merge branch 'EighOP' of https://github.com/Zjq9409/Paddle into EighOP

fa0ed0a

Xreki reviewed Sep 1, 2021

View reviewed changes

remove the reverse calculation create op

e95d4c0

Zjq9409 added 2 commits September 14, 2021 04:22

Add Eigenvector to whitelist

f7854e1

Merge branch 'EighOP' of https://github.com/Zjq9409/Paddle into EighOP

b77a4b8

Zjq9409 commented Sep 14, 2021

View reviewed changes

Zjq9409 added 2 commits September 15, 2021 02:44

Modify variable name

6be5f8f

Merge branch 'develop' into EighOP

61b71a5

Xreki previously approved these changes Sep 15, 2021

View reviewed changes

cryoco previously approved these changes Sep 15, 2021

View reviewed changes

jzhang533 previously approved these changes Sep 15, 2021

View reviewed changes

Zjq9409 added 2 commits September 15, 2021 06:58

Merge branch 'develop' into EighOP

98f46cb

Merge branch 'EighOP' of https://github.com/Zjq9409/Paddle into EighOP

823a50f

Zjq9409 dismissed stale reviews from jzhang533, cryoco, and Xreki via 823a50f September 15, 2021 07:00

Zjq9409 added 2 commits September 15, 2021 07:53

Merge branch 'develop' into EighOP

83939f2

Merge branch 'develop' into EighOP

b629849

cryoco previously approved these changes Sep 15, 2021

View reviewed changes

Merge branch 'develop' into EighOP

3944f53

jzhang533 previously approved these changes Sep 15, 2021

View reviewed changes

Xreki previously approved these changes Sep 15, 2021

View reviewed changes

Merge branch 'develop' into EighOP

d478e09

Zjq9409 dismissed stale reviews from Xreki, jzhang533, and cryoco via d478e09 September 16, 2021 02:09

cryoco approved these changes Sep 16, 2021

View reviewed changes

jiangjiajun approved these changes Sep 16, 2021

View reviewed changes

Xreki approved these changes Sep 16, 2021

View reviewed changes

jzhang533 approved these changes Sep 16, 2021

View reviewed changes

lanxianghit approved these changes Sep 16, 2021

View reviewed changes

Xreki merged commit 07d0b83 into PaddlePaddle:develop Sep 16, 2021

AnnaTrainingG pushed a commit to AnnaTrainingG/Paddle that referenced this pull request Sep 29, 2021

Add CPU and GPU eigh op implementation (PaddlePaddle#34990)

ae9e4c3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CPU and GPU eigh op implementation #34990

Add CPU and GPU eigh op implementation #34990

Zjq9409 commented Aug 18, 2021 •

edited

Loading

paddle-bot-old bot commented Aug 18, 2021

Xreki Sep 1, 2021

Zjq9409 Sep 1, 2021

Xreki Sep 1, 2021

Zjq9409 Sep 2, 2021

Zjq9409 Sep 14, 2021

Xreki Sep 15, 2021

Xreki left a comment

Xreki Sep 15, 2021

Xreki Sep 15, 2021

Xreki Sep 15, 2021

Xreki Sep 15, 2021

Xreki Sep 15, 2021

Xreki Sep 15, 2021

Xreki Sep 15, 2021

Xreki Sep 15, 2021

Xreki Sep 15, 2021

Xreki Sep 15, 2021

cryoco left a comment

jzhang533 left a comment

jzhang533 left a comment

jiangjiajun left a comment

jzhang533 left a comment

Add CPU and GPU eigh op implementation #34990

Add CPU and GPU eigh op implementation #34990

Conversation

Zjq9409 commented Aug 18, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Aug 18, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Xreki left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cryoco left a comment

Choose a reason for hiding this comment

jzhang533 left a comment

Choose a reason for hiding this comment

jzhang533 left a comment

Choose a reason for hiding this comment

jiangjiajun left a comment

Choose a reason for hiding this comment

jzhang533 left a comment

Choose a reason for hiding this comment

Zjq9409 commented Aug 18, 2021 •

edited

Loading