Implement Fused BN + Add + Relu with cudnnFusedOps API. #35955

ZzSean · 2021-09-23T06:33:26Z

PR types

New features

PR changes

OPs

Describe

Add bn_add_relu test

	x	mean	invstd	eq_scale	eq_bias	y	y_hf
base	0.600098	0.574712	3.685992	3.685992	-2.118384	0.093573	0.093567
fuse	0.600098	0.574712	3.685993	3.685547	-2.119141	0.092547	0.092529

对于原始的batchnorm来说，中间所有结果的计算都是 float类型，只是在最后输出时做了一次fp32->fp16的cast；
而在融合计算中，虽然mean和std等结果都与原始相同，但是bn_finalize会输出一个fp16类型的eq_scale和eq_bias，用来进行最后的乘加，因此最终的结果会出现误差，提高单测阈值至2e-3

paddle-bot-old · 2021-09-23T06:33:49Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle/fluid/operators/fused/cudnn_bn_add_relu_test.cc

paddle/fluid/operators/fused/cudnn_bn_stats_finalize.cu.h

paddle/fluid/operators/fused/cudnn_scale_bias_add_relu.cu.h

Xreki

LGTM，代码后续PR继续完善。

Xreki · 2021-10-09T08:00:43Z

paddle/fluid/operators/fused/cudnn_scale_bias_add_relu.cu.h

+        fwd_workspace_byte_);
+  }
+
+  void Backward(const platform::CUDADeviceContext &ctx, T *dy_ptr, T *x_ptr,


感觉Forward、Backward实现在不同的类里面比较好，因为这两个Forward、Backward并不是完全对应的。

Xreki · 2021-10-09T08:01:29Z

paddle/fluid/operators/fused/cudnn_scale_bias_add_relu.cu.h

@@ -0,0 +1,292 @@
+/* Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.


cudnn_bn_stats_finalize.cu.h和cudnn_scale_bias_add_relu.cu.h两个文件可以考虑合并成一个。

paddle-bot-old bot referenced this pull request Sep 23, 2021

Add bn_add_relu test

f4b746d

Xreki reviewed Sep 24, 2021

View reviewed changes

ZzSean added 4 commits October 9, 2021 03:42

Add bn_add_relu test

dc623ca

change .cu to .cc

7cdd9e5

fix CMakeList

ced8f68

delete unused code

1aed85f

ZzSean force-pushed the cudnn_bn_add_relu_test branch from 99b1a72 to 1aed85f Compare October 9, 2021 03:44

ZzSean added 4 commits October 9, 2021 03:53

update cudnn code and test

e6e9b76

fix

debde96

delete backward test

aef2b9b

fix

d709ca2

Xreki approved these changes Oct 9, 2021

View reviewed changes

Xreki changed the title ~~Add bn_add_relu test~~ Implement Fused BN + Add + Relu with cudnnFusedOps API. Oct 9, 2021

ZzSean mentioned this pull request Oct 9, 2021

Add more tests and fix bugs for cudnn_norm_conv_test and cudnn_bn_and_relu_test #36314

Merged

Xreki merged commit 7e6c0ce into PaddlePaddle:develop Oct 9, 2021

This was referenced Oct 12, 2021

Add the complete code and related files of resnet_unit_op #36366

Merged

Add ResNetUnit Python API #35426

Merged

ZzSean deleted the cudnn_bn_add_relu_test branch October 15, 2021 06:40

sneaxiy mentioned this pull request Nov 10, 2021

MLPerf Optimization for Release/2.2 #37109

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Fused BN + Add + Relu with cudnnFusedOps API. #35955

Implement Fused BN + Add + Relu with cudnnFusedOps API. #35955

ZzSean commented Sep 23, 2021 •

edited

Loading

paddle-bot-old bot commented Sep 23, 2021

Xreki left a comment

Xreki Oct 9, 2021

Xreki Oct 9, 2021

		@@ -0,0 +1,292 @@
		/* Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.

Implement Fused BN + Add + Relu with cudnnFusedOps API. #35955

Implement Fused BN + Add + Relu with cudnnFusedOps API. #35955

Conversation

ZzSean commented Sep 23, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Sep 23, 2021

Xreki left a comment

Choose a reason for hiding this comment

Xreki Oct 9, 2021

Choose a reason for hiding this comment

Xreki Oct 9, 2021

Choose a reason for hiding this comment

ZzSean commented Sep 23, 2021 •

edited

Loading