Refactor the organization of layer_norm cuda impl. #34883

limin2021 · 2021-08-13T05:39:57Z

PR types

Performance optimization

PR changes

OPs

Describe

Refactor the organization of layer_norm cuda impl so that it can be reused in fused attention op.

Extract the layer_norm cuda impl form layer_norm_op.cu to layer_norm_kernel.cu.h.
Define fused/attention_layer_norm.h, which can be used in fused attention op in next PR.

Test the correctness of layer_norm op after refactoring：

V100, cuda10.1

xingfeng01 · 2021-08-17T05:33:41Z

LGTM

lanxianghit

LGTM

limin2021 added 2 commits August 13, 2021 05:19

Refactor the organization of layer_norm cuda impl.

1353e9d

Update attention_layer_norm.h

67ecdd7

xingfeng01 approved these changes Aug 19, 2021

View reviewed changes

lanxianghit approved these changes Aug 20, 2021

View reviewed changes

lanxianghit merged commit 7f5eb53 into PaddlePaddle:develop Aug 23, 2021

limin2021 mentioned this pull request Sep 14, 2021

Add fused_attention_op #35727

Closed

This was referenced Sep 23, 2021

Fused attention op forward #35905

Merged

Fused attention op backward #35935

Closed

This was referenced Oct 18, 2021

Add fused attention op backward and python layer. #36498

Merged

[cherry-pick] Cherry pick fused attn fw #36636

Closed

[cherry-pick] Cherry pick fused attn fw #36677

Closed

This was referenced Oct 25, 2021

[cherry-pick-2.2] Fused attention op forward #36708

Merged

[cherry-pick-2.2]Add fused attention op backward and python layer. #36752

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor the organization of layer_norm cuda impl. #34883

Refactor the organization of layer_norm cuda impl. #34883

limin2021 commented Aug 13, 2021 •

edited

Loading

xingfeng01 commented Aug 17, 2021

lanxianghit left a comment

Refactor the organization of layer_norm cuda impl. #34883

Refactor the organization of layer_norm cuda impl. #34883

Conversation

limin2021 commented Aug 13, 2021 • edited Loading

PR types

PR changes

Describe

Test the correctness of layer_norm op after refactoring：

xingfeng01 commented Aug 17, 2021

lanxianghit left a comment

Choose a reason for hiding this comment

limin2021 commented Aug 13, 2021 •

edited

Loading