add flattern weight of lstm #27192

GaoWei8 · 2020-09-08T12:27:51Z

PR types

New features

PR changes

OPs

Describe

将lstm cudnn的大块weight改成weight list 输入。如果python端使用的相邻内存，c++ 端直接只用首指针和大小调用；否则需要在c++端复制成一大块内存。
test模式下，若提供了W并且已初始化就优先使用W，否则使用WeightList，并将转换得到的参数保存在W中。

需要approve的内容

You must have one RD (cyj1986, Superjomn) approval for the changes of Inputs/Output/Attrs of OPs. The changes of OPs will cause that the new version inference fails to load model trained by the old version. Please modify your code.
You must have one RD (zhiqiu (Recommend) or phlrain) approval for the api change for the opreator-related api without 'core.ops'.
fluid.layers.lstm接口接下来会废除，使用PR27217中定义的新接口。
You must have one RD (XiaoguangHu01,Xreki,luotao1) approval for the usage (either add or delete) of const_cast.
Using ShareDataWith or ShareBufferWith is not recommended. You must have one RD's (zhhsplendid (Recommend), zhiqiu or luotao1 or lanxianghit) approval to use these methods. For more information, please refer to https://github.com/PaddlePaddle/Paddle/wiki/ShareDataWith-is-prohibited-in-OP. The error lines are as follows:
It is an Op accuracy problem, please take care of it. You must have one RD (zhangting2020 (Recommend), luotao1 or phlrain) approval for the usage (either add or delete) of @skip_check_grad_ci. For more information, please refer to: https://github.com/PaddlePaddle/Paddle/wiki/Gradient-Check-Is-Required-for-Op-Test.

接口兼容性问题的论证

增加的Reserve，StateOut输出接口是为了支持cudnn lstm C++ kernel在动态图中的使用。
原有lstm接口的双向在计算结果的维度会产生错误。
原有lstm接口的多层的结果有问题，原有接口一直用的输入是padding数据。但是用的cudnn的接口是处理unpadding数据的接口，虽然可以调用原有API进行计算，但是计算的结果和精度都是存在问题的。目前自有模型库仅存在1个调用原有API进行计算，且为多层的计算，应该修改为使用新接口计算。
外部用户因为API计算错误不可能在用。附上两个外部用户提出的issue lstm错误 #24300 fluid.layers.lstm接口参数is_bidirec不能发挥双向的作用 #22979
所以计划lstm op在2.0会进行大幅度的修改。以后也不会推荐用老的API接口，而是使用新增的API接口。

paddle-bot-old · 2020-09-08T12:28:00Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle/fluid/operators/cudnn_lstm_op.cu.cc

guoshengCS · 2020-09-11T07:00:22Z

paddle/fluid/operators/cudnn_lstm_op.cu.cc

@@ -271,6 +363,8 @@ class CudnnLSTMGPUGradKernel : public framework::OpKernel<T> {
              "of cudnn is larger than 7.2.1"));
 #endif
    }
+    weight_to_tensor_list<T>(place, stream, &weight_grad_list, weight_list,


这里是否有更好的处理方式呢，感觉如果weight是不连续的grad才需要拷贝，如果weight是连续的话grad最好也是连续的，可能还是sharedata好些

已设置成和weight相同的策略，只有不连续时才拷贝。

guoshengCS · 2020-09-16T03:56:11Z

是否需要同时使用WeightList和W呢，这里有两点考虑：

兼容性，如果直接去掉W可能导致原来保存的C++预测模型使用不了
预测性能，C++预测时用户无法像python中那样调用flatten_parameters使得WeightList中的参数内存连续，这会使得在预测时OP内每次都进行拷贝，如果保留W，其中保存转换后的参数，test模式下只在W未初始化时进行拷贝，这样能够避免每次拷贝。

test模式下，若提供了W并且已初始化就优先使用W，否则使用WeightList，并将转换得到的参数保存在W中。

guoshengCS · 2020-09-25T06:41:47Z

paddle/fluid/operators/cudnn_lstm_op.cu.cc

+    }
+
+    bool grad_continuous =
+        is_continuous<T, std::vector<Tensor *>>(weight_grad_list);


由于weight_grad_list中的各weight grad是由weight_grad_list[i]->mutable_data<T>(place)分配的，这里很大可能会是不连续的，可能还是会每次都拷贝梯度。可否直接使用大块的weight grad，然后各小weight grad从中ShareDataWith。

尝试过，输入是weight list，在c++端只能也用weight list的方式得到grad的计算。但是python端应该可以通过一些方式，把grad的内存分配为连续内存。

已修改为输入是weight list与大W共享内存的方式。

guoshengCS · 2020-09-30T03:59:38Z

paddle/fluid/operators/cudnn_lstm_op.cu.cc

+        auto weight_list = ctx.MultiInput<framework::Tensor>("WeightList");
+        W->mutable_data<T>({weight_numel}, place);
+        weight_to_tensor<T>(place, stream, weight_list, W);
+      }


这里is_test==true时是否会每次拷贝呢，可否在W未被初始化的时候拷贝呢

Python端预测时会初始化W，这时候用的是W, 而且不会拷贝数据。
C++预测时不会初始化W，用的是weight_list，但是会拷贝weight_list到W。

luotao1

LGTM

Superjomn

LGTM

* add flattern weight of lstm

GaoWei8 force-pushed the flattern_weight branch from 29cef36 to ab8d51c Compare September 10, 2020 01:43

guoshengCS reviewed Sep 11, 2020

View reviewed changes

GaoWei8 added 2 commits September 17, 2020 01:02

add flattern weight of lstm

5e98cca

polish codes,notest

9a7388e

GaoWei8 force-pushed the flattern_weight branch from ab8d51c to 9a7388e Compare September 17, 2020 02:18

reduce weight grad copy, notest

f7911c4

guoshengCS mentioned this pull request Sep 17, 2020

Incorporate cudnn_lstm into LSTM api #27217

Merged

GaoWei8 added 2 commits September 21, 2020 12:30

polish codes

feecefa

polish codes

8414988

GaoWei8 force-pushed the flattern_weight branch from 166a514 to 8414988 Compare September 22, 2020 05:19

add api test

3a58438

guoshengCS reviewed Sep 25, 2020

View reviewed changes

guoshengCS requested review from Xreki and phlrain September 27, 2020 03:03

add infernce version

ebd3b35

GaoWei8 force-pushed the flattern_weight branch from a99032a to ebd3b35 Compare September 28, 2020 05:18

polish codes

8c02146

add share grad

8cf5cc7

GaoWei8 force-pushed the flattern_weight branch from f3dfa83 to 8cf5cc7 Compare September 29, 2020 02:06

Merge branch 'tem' into flattern_weight

92f6ea6

GaoWei8 force-pushed the flattern_weight branch from 1511ea5 to 92f6ea6 Compare September 29, 2020 03:31

polish codes

d7f8672

guoshengCS reviewed Sep 30, 2020

View reviewed changes

W not initialize

b8369eb

GaoWei8 requested a review from guoshengCS September 30, 2020 06:39

guoshengCS approved these changes Oct 11, 2020

View reviewed changes

luotao1 approved these changes Oct 12, 2020

View reviewed changes

Superjomn approved these changes Oct 12, 2020

View reviewed changes

zhiqiu self-requested a review October 12, 2020 05:17

zhiqiu approved these changes Oct 12, 2020

View reviewed changes

GaoWei8 merged commit 36bb056 into PaddlePaddle:develop Oct 12, 2020

chen-zhiyu pushed a commit to chen-zhiyu/Paddle that referenced this pull request Oct 15, 2020

Add flattern weight of lstm (PaddlePaddle#27192)

9ec8766

* add flattern weight of lstm

GaoWei8 deleted the flattern_weight branch January 7, 2021 09:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add flattern weight of lstm #27192

add flattern weight of lstm #27192

GaoWei8 commented Sep 8, 2020 •

edited

Loading

paddle-bot-old bot commented Sep 8, 2020

guoshengCS Sep 11, 2020

GaoWei8 Sep 22, 2020

guoshengCS commented Sep 16, 2020 •

edited

Loading

guoshengCS Sep 25, 2020

GaoWei8 Sep 25, 2020 •

edited

Loading

GaoWei8 Oct 12, 2020

guoshengCS Sep 30, 2020

GaoWei8 Sep 30, 2020

luotao1 left a comment

Superjomn left a comment

add flattern weight of lstm #27192

add flattern weight of lstm #27192

Conversation

GaoWei8 commented Sep 8, 2020 • edited Loading

PR types

PR changes

Describe

需要approve的内容

接口兼容性问题的论证

paddle-bot-old bot commented Sep 8, 2020

guoshengCS Sep 11, 2020

Choose a reason for hiding this comment

GaoWei8 Sep 22, 2020

Choose a reason for hiding this comment

guoshengCS commented Sep 16, 2020 • edited Loading

guoshengCS Sep 25, 2020

Choose a reason for hiding this comment

GaoWei8 Sep 25, 2020 • edited Loading

Choose a reason for hiding this comment

GaoWei8 Oct 12, 2020

Choose a reason for hiding this comment

guoshengCS Sep 30, 2020

Choose a reason for hiding this comment

GaoWei8 Sep 30, 2020

Choose a reason for hiding this comment

luotao1 left a comment

Choose a reason for hiding this comment

Superjomn left a comment

Choose a reason for hiding this comment

GaoWei8 commented Sep 8, 2020 •

edited

Loading

guoshengCS commented Sep 16, 2020 •

edited

Loading

GaoWei8 Sep 25, 2020 •

edited

Loading