【Hackathon No.46】为 Paddle gumbel_softmax 算子实现 float16 数据类型支持 #50923

denglianbin · 2023-02-26T04:31:43Z

PR types

Others

PR changes

Others

Describe

性能数据（op benchmark）

shape	hard	axis	fp32	fp16
128, 128	TRUE	-1	0.088926481	0.070158559
100	TRUE	-1	0.059882962	0.052535534
20, 10, 5	TRUE	1	0.064530178	0.057325801
20, 10, 5	FALSE	1	0.051758971	0.047544071

paddle-bot · 2023-02-26T04:31:47Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

CLAassistant · 2023-02-26T04:31:48Z

All committers have signed the CLA.

zhangting2020 · 2023-03-06T05:51:47Z

paddle/phi/kernels/gpu/gumbel_softmax_kernel.cu

+        index_sequence_begin + size,
+        thrust::device_ptr<T>(random_data),
+        UniformCUDAGenerator<T>(static_cast<phi::dtype::float16>(0.00001),
+                                static_cast<phi::dtype::float16>(1),


这里是否有修改的必要？无论T为何种类型，这里都cast到FP16

zhangting2020 · 2023-03-06T06:41:03Z

python/paddle/fluid/tests/unittests/test_gumbel_softmax_op.py

+        self.count_expected = 100
+        self.dtype = np.float16
+
+


FP16的单测需要参考低精度算子的单测规范进行修改：

规范说明（可以参考step2）https://www.paddlepaddle.org.cn/documentation/docs/zh/develop/dev_guides/amp_precision/amp_test_dev_guide_cn.html#step2

一些框架中的PR：

[AMP OP&Test] adjust test_elementwise_sub's tolerance, max_relative_error of grad and #50953

【AMP OP&Test】unit test for test_logit_op #51051

zhangting2020 · 2023-03-06T06:42:21Z

另外记得提交中文文档的修改PR，可以在这个PR说明中引用一下或者提交PR后assign给我，我会review

denglianbin · 2023-03-12T11:04:49Z

@zhangting2020 老师您好，这是中文文档更新pr: PaddlePaddle/docs#5707

zhangting2020 · 2023-03-14T09:33:32Z

python/paddle/fluid/tests/unittests/test_gumbel_softmax_op.py

+        self.check_output()
+
+    def test_check_grad(self):
+        self.check_grad(["X"], "Out")


FP16的单测需要继承TestGumbelSoftmaxOp，实际上只需要为fp16的case重写init_attrs，可以减少冗余代码。
TestGumbelSoftmax_ZeroDim_FP16OP -> TestGumbelSoftmaxFP16OP

老师，您好，这里是参考单测中原来写法。针对于ZeroDim单独继承optest进行测试，其余各test继承TestGumbelSoftmaxOp并重写init_attr()。我这里也是针对于ZeroDim单独处理了。所以直接继承了optest。后续四个test都是直接继承TestGumbelSoftmaxOp并重写init_attr()的。

原始写法我想并不是最优的。TestGumbelSoftmax_ZeroDim里面其实重写init_attr也可以吧

老师您好，我尝试了直接用TestGumbelSoftmax_ZeroDim继承TestGumbelSoftmaxOp基类，但是由于基类中check_out是针对多维重写的check_out_custormized，并不适用于ZeroDim。因此我在TestGumbelSoftmax_ZeroDim中添加了init_attr方法，并令TestGumbelSoftmax_ZeroDimFP16继承修改。

zhangting2020 · 2023-03-14T09:43:43Z

python/paddle/fluid/tests/unittests/test_gumbel_softmax_op.py

+        self.attrs = {"hard": True, "axis": 1}
+        self.count_expected = 100
+        self.dtype = np.float16
+


这4个单测继承TestGumbelSoftmaxFP16OP。

您好，因为前面TestGumbelSoftmax_ZeroDim_FP16OP是针对于ZeroDim的，所以内部没有init_attrs()函数。无法更改名字为TestGumbelSoftmaxFP16OP。所以直接继承自TestGumbelSoftmaxOp。

… gumbel_softmax

denglianbin · 2023-03-15T02:43:06Z

@zhangting2020 您好，请问static_check这条CI中会报这个错误，看着是CI环境镜像问题。本地尝试了报错的sample，没有问题。这种问题需要做什么处理吗？

luotao1 · 2023-03-16T03:05:40Z

请问static_check这条CI中会报这个错误，看着是CI环境镜像问题

已解决，参考 #51512 (comment) 。目前流水线已经是正常的了。 @denglianbin

denglianbin · 2023-03-16T09:04:17Z

请问static_check这条CI中会报这个错误，看着是CI环境镜像问题

已解决，参考 #51512 (comment) 。目前流水线已经是正常的了。 @denglianbin

好的老师，已经重新跑了，最后报错是find RD for approval first了，应该是正常的了。

zhangting2020

LGTM

finish task

c79ba20

paddle-bot bot added contributor External developers status: proposed labels Feb 26, 2023

denglianbin changed the title ~~【Hackathon + No.任务编号】为 Paddle gumbel_softmax 算子实现 float16 数据类型支持~~ 【Hackathon No.46】为 Paddle gumbel_softmax 算子实现 float16 数据类型支持 Feb 26, 2023

denglianbin mentioned this pull request Feb 26, 2023

【PaddlePaddle Hackathon 第四期】任务总览 #50629

Closed

luotao1 assigned luotao1, zhangting2020 and cloud2009 Feb 27, 2023

zhangting2020 self-requested a review March 6, 2023 05:24

zhangting2020 reviewed Mar 6, 2023

View reviewed changes

luotao1 assigned Ligoml Mar 6, 2023

Ligoml mentioned this pull request Mar 7, 2023

【PaddlePaddle Hackathon 第四期】任务总览 #51281

Closed

fix some question.

2ee3f49

fix error

3eb74a6

zhangting2020 reviewed Mar 14, 2023

View reviewed changes

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

5a6031f

… gumbel_softmax

change unittest:zeroDim.

5965ebf

zhangting2020 approved these changes Mar 17, 2023

View reviewed changes

luotao1 merged commit e0007f3 into PaddlePaddle:develop Mar 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【Hackathon No.46】为 Paddle gumbel_softmax 算子实现 float16 数据类型支持 #50923

【Hackathon No.46】为 Paddle gumbel_softmax 算子实现 float16 数据类型支持 #50923

denglianbin commented Feb 26, 2023 •

edited

Loading

paddle-bot bot commented Feb 26, 2023

CLAassistant commented Feb 26, 2023 •

edited

Loading

zhangting2020 Mar 6, 2023

denglianbin Mar 12, 2023

zhangting2020 Mar 6, 2023

denglianbin Mar 12, 2023

zhangting2020 commented Mar 6, 2023

denglianbin commented Mar 12, 2023

zhangting2020 Mar 14, 2023

denglianbin Mar 14, 2023

zhangting2020 Mar 15, 2023

denglianbin Mar 15, 2023

zhangting2020 Mar 14, 2023

denglianbin Mar 14, 2023

denglianbin commented Mar 15, 2023

luotao1 commented Mar 16, 2023 •

edited

Loading

denglianbin commented Mar 16, 2023

zhangting2020 left a comment

【Hackathon No.46】为 Paddle gumbel_softmax 算子实现 float16 数据类型支持 #50923

【Hackathon No.46】为 Paddle gumbel_softmax 算子实现 float16 数据类型支持 #50923

Conversation

denglianbin commented Feb 26, 2023 • edited Loading

PR types

PR changes

Describe

paddle-bot bot commented Feb 26, 2023

CLAassistant commented Feb 26, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhangting2020 commented Mar 6, 2023

denglianbin commented Mar 12, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

denglianbin commented Mar 15, 2023

luotao1 commented Mar 16, 2023 • edited Loading

denglianbin commented Mar 16, 2023

zhangting2020 left a comment

Choose a reason for hiding this comment

denglianbin commented Feb 26, 2023 •

edited

Loading

CLAassistant commented Feb 26, 2023 •

edited

Loading

luotao1 commented Mar 16, 2023 •

edited

Loading