[Auto Parallel] Integrate all modules #35483

aoyulong · 2021-09-06T03:56:59Z

PR types

Others

PR changes

Others

Describe

Integrate all parts of auto parallel and improve codes:

Integrate all parts by AutoParallelizer
Add unit test for AutoParallelizer
Improve auto completion module for pipeline parallel
Add support for matmul_v2 in dist_matmul
Correct the typo "stratergy" to "strategy"

… auto_parallel_basic

… auto_parallel_integration

* Integrate all parts by AutoParallelizer * Add unit test for AutoParallelizer * Improve auto completion module for pipeline parallel * Add support for matmul_v2 in dist_matmul * Correct the typo "stratergy" to "strategy"

… auto_parallel_integration

paddle-bot-old · 2021-09-06T03:57:03Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

fuyinno4

please rename Impl0

* add auto_parallel dir * mv to paddle.distributed * add shard_xx api * add distributed attrs for var * add ut, test=develop * add dist * update * update * update * update * update * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update * update * update * update * update * update, test=develop * update, test=develop * update * update * delete unused proto * resotre op_desc * restore type_defs * update var_desc * remove dimss_mapping for proto_pybind * update interface.py * update framework.py * update * update * add auto_parallel dir * mv to paddle.distributed * add shard_xx api * add distributed attrs for var * add ut, test=develop * [WIP] Add the auto completion feature and related codes * [WIP] Improve the auto completion and related codes * [WIP] Make the auto completion to support data-parallel * [WIP] Make the completion support mp and dp+mp * [WIP] Refactor auto completion unit test for MLP * [WIP] Refactor the implementation of DistributedOperatorImpl * [WIP] Improve dims_mapping update rule and fix a bug * [WIP] Support auto completion for one transformer decoder layer * [WIP] Add a minor change * [WIP] Fix a bug within the uint test * Shard XShape tensor, add embedding completion and refactor code * Add the distributed_operators dir to setup.py.in * Improve the completion process and add the unittest for gpt * fix process_mesh ut * fix process_mesh ut * update * update, test=develop * Add support for automatically completing distributed attrs of special ops * update * update * update * fix doc sample codes, test=develop * improve coverage, test=develop * add static_mode check, test=develop * Model the cluster for cost model and physical mapping * update, test=develop * add set_placement, test=develop * Add the check to make sure the candidate tensors' size is great than zero * update doc, test=develop * update doc, test=develop * update doc, test=develop * update doc, test=develop * update, test=develop * Auto mark dist attrs annotated by user * update ndarray to nested list, test=develop * update, test=develop * Add auto-completion module for auto-parallel (based on PR#33804) * Remove unnecessary files * Remove unrelated files for the auto completion pr * Update the unit test to improve the coverage * Modify codes based on reviews * Minor changes for CI * Improve some codes based on new comments * Fix bugs caused by shallow copy in attributes.py * Imporve amend_distributed_attr_for_program in context.py * Other changes for weihang's comments * support shard reader * support shard reader * add parallel mode * update process mesh * add method to compute comm_group * implement dist_embedding forward func * implement dist matmul forward func * implement dist reshape forward func * add transpiler framework * add transpiler forward * implement transpiler forward * implement transpiler backward & update * add process * add unitest * chmod * chmod * chmod * update unitest * add unitest for gpt * remove unused print * rename transpiler --> partitioner * rename transpiler --> partitioner * chmod * chmod * bug fixed * remove amp function * update case for dp mode * update case for dp mode * [Auto Parallel] Integrate all parts with the newest code * Integrate all parts of auto parallel and improve codes * Integrate all parts by AutoParallelizer * Add unit test for AutoParallelizer * Improve auto completion module for pipeline parallel * Add support for matmul_v2 in dist_matmul * Correct the typo "stratergy" to "strategy" * Modify distributed_strategy.proto to conform the main stream * Restore parts of distributed_strategy to conform the develop branch Co-authored-by: sandyhouse <lilong12@baidu.com> Co-authored-by: JZ-LIANG <jianzhongliang10@gmail.com>

sandyhouse added 30 commits June 28, 2021 10:56

add auto_parallel dir

b985745

mv to paddle.distributed

b79e749

add shard_xx api

1671850

add distributed attrs for var

ec55a43

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

25abc00

… auto_parallel_basic

add ut, test=develop

bf24fb7

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

8ea9363

… auto_parallel_basic

add dist

9e4b3d8

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

e65f77e

… auto_parallel_basic

update

8b95c1e

update

ccae6ae

update

d107751

update

f7e70ea

update

3111159

update, test=develop

70cdb69

update, test=develop

9e5b0f0

update, test=develop

59936ef

update, test=develop

27ee413

update, test=develop

3a8ceef

update, test=develop

d11f317

update, test=develop

f5ef245

update

7293b4f

update

1240edc

update

05455fb

update

3e1b3a0

update

8950c35

update, test=develop

b94a9f2

update, test=develop

e121349

update

fe51aa3

update

4563d42

JZ-LIANG and others added 22 commits August 24, 2021 17:21

add process

52d054c

add unitest

1b8ddfb

chmod

ae3e506

chmod

e2fa7cd

chmod

de53039

update unitest

fbe3356

add unitest for gpt

d0798cb

remove unused print

fbc42d6

rename transpiler --> partitioner

f0f58dc

rename transpiler --> partitioner

f5cd926

chmod

2ebece8

chmod

b22ea19

bug fixed

cc694b1

remove amp function

1cc96ca

update case for dp mode

4fb30ef

update case for dp mode

56ff62e

Merge branch 'pr_35117' into auto_parallel_integration

4ec9f80

[Auto Parallel] Integrate all parts with the newest code

0cb34e2

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

89b467f

… auto_parallel_integration

Integrate all parts of auto parallel and improve codes

d7286cb

* Integrate all parts by AutoParallelizer * Add unit test for AutoParallelizer * Improve auto completion module for pipeline parallel * Add support for matmul_v2 in dist_matmul * Correct the typo "stratergy" to "strategy"

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

4fb96e6

… auto_parallel_integration

Modify distributed_strategy.proto to conform the main stream

00d699e

Restore parts of distributed_strategy to conform the develop branch

79f1025

XieYunshen mentioned this pull request Sep 7, 2021

disable added ut check,test=document_fix #35535

Merged

JZ-LIANG self-requested a review September 8, 2021 08:24

fuyinno4 approved these changes Sep 8, 2021

View reviewed changes

fuyinno4 merged commit 1215535 into PaddlePaddle:develop Sep 8, 2021

aoyulong deleted the auto_parallel_integration branch December 10, 2021 03:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Auto Parallel] Integrate all modules #35483

[Auto Parallel] Integrate all modules #35483

aoyulong commented Sep 6, 2021 •

edited

Loading

paddle-bot-old bot commented Sep 6, 2021

fuyinno4 left a comment

[Auto Parallel] Integrate all modules #35483

[Auto Parallel] Integrate all modules #35483

Conversation

aoyulong commented Sep 6, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Sep 6, 2021

fuyinno4 left a comment

Choose a reason for hiding this comment

aoyulong commented Sep 6, 2021 •

edited

Loading