[Example] Use .fuse() primitive when possible #42

chhzh123 · 2023-02-07T03:31:44Z

Description

This PR fixes some issues for the example models:

Use .fuse() to fuse the bias+GeLU in the MLP module. Since TorchScript module cannot be hooked and cannot properly work with DeepSpeed ZeRO-3, this feature is unset by default.
Use subgraph matching to only optimize a small part of the Albert model instead of replacing a whole layer
Fix Albert attention mask shape
Remove useless functions in the schedule
Fix an argument issue in .replace() and support bias+layernorm fusion

Checklist

PR's title starts with a category (e.g. [Bugfix], [Model], [Tutorial], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

chhzh123 · 2023-02-07T03:34:00Z

I didn't change the GPT example to avoid conflicts. You can merge #41 first, and I will update other models correspondingly. @comaniac

comaniac

Otherwise LGTM

examples/albert/model.py

examples/opt/schedule.py

tests/test_fusion.py

comaniac

LGTM

examples/albert/model.py

chhzh123 · 2023-02-08T01:18:35Z

Please do not merge first. I'll also remove the epoi dependency.

chhzh123 · 2023-02-08T01:19:00Z

Or do you think it is better to create a separate PR? @comaniac

comaniac · 2023-02-08T01:34:37Z

I'm fine with both, so just for your convenience.

chhzh123 · 2023-02-10T16:47:05Z

I've changed all the epoi.attention to slapo.op.attention (except for the T5 one). This PR is ready to review and merge @comaniac.

slapo/op/attention.py

comaniac · 2023-02-10T19:55:02Z

Thanks @chhzh123

chhzh123 requested a review from comaniac February 7, 2023 03:31

comaniac reviewed Feb 7, 2023

View reviewed changes

examples/albert/model.py Outdated Show resolved Hide resolved

examples/opt/schedule.py Outdated Show resolved Hide resolved

tests/test_fusion.py Outdated Show resolved Hide resolved

chhzh123 force-pushed the mlp_fusion branch from ef0e22b to 375b805 Compare February 7, 2023 23:46

comaniac approved these changes Feb 8, 2023

View reviewed changes

examples/albert/model.py Outdated Show resolved Hide resolved

chhzh123 added 22 commits February 10, 2023 04:56

Support fuse bias layernorm

2564827

Add fuse_bias_gelu to bert

59177d1

Fix bias_ln

9ed1f02

Add separate op_fusion

34f2b24

Fix pylint

7bb9562

Add fuse_bias_gelu to albert & Fix shape

87d03b4

Add flash_attn to list_envs

9a2b23d

Add gelu act to albert

71edf5e

Add bias_gelu_fusion to roberta

d7e2827

Add separate_fusion to opt

8b98681

Update README

5138b75

Uncomment fuse_bias_gelu in albert

b21727b

Add flag to albert

63fb4f8

Fix albert

4318a89

Disable fuse_bias_gelu

2381d33

Refactor opt schedule

4c4fe5d

Remove timing test

9239270

Fix format

29e26d0

Fix flag

6f3ae92

Reset default value of disable_fuse_bias_gelu

770019f

Delete demo

1c8e277

Fix attention name and signature

94bd8bf

chhzh123 added 10 commits February 10, 2023 04:56

Use slapo.op to schedule albert

407814b

Fix GPT schedule

c247d6f

Fix BERT

bec0c7d

Fix bert & roberta

0b7b202

Fix OPT

9d83269

Update OPT schedule

7b0ff5c

Fix albert

6a54c62

Fix bert

010361f

Fix gpt opt

aac2444

Fix roberta

85be556

chhzh123 force-pushed the mlp_fusion branch from 7e94ba9 to 85be556 Compare February 10, 2023 05:58

chhzh123 added 3 commits February 10, 2023 06:18

Fix format

7bfd769

Add mlp to opt

0437c09

Update op test

eacf24c

chhzh123 requested a review from comaniac February 10, 2023 16:47

comaniac reviewed Feb 10, 2023

View reviewed changes

slapo/op/attention.py Show resolved Hide resolved

Fix gpt2

86a8b8a

comaniac merged commit 6c2e235 into awslabs:main Feb 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Example] Use .fuse() primitive when possible #42

[Example] Use .fuse() primitive when possible #42

chhzh123 commented Feb 7, 2023

chhzh123 commented Feb 7, 2023

comaniac left a comment

comaniac left a comment

chhzh123 commented Feb 8, 2023

chhzh123 commented Feb 8, 2023

comaniac commented Feb 8, 2023

chhzh123 commented Feb 10, 2023

comaniac commented Feb 10, 2023

[Example] Use .fuse() primitive when possible #42

[Example] Use .fuse() primitive when possible #42

Conversation

chhzh123 commented Feb 7, 2023

Description

Checklist

chhzh123 commented Feb 7, 2023

comaniac left a comment

Choose a reason for hiding this comment

comaniac left a comment

Choose a reason for hiding this comment

chhzh123 commented Feb 8, 2023

chhzh123 commented Feb 8, 2023

comaniac commented Feb 8, 2023

chhzh123 commented Feb 10, 2023

comaniac commented Feb 10, 2023