Add NPU FusedAdam support by CurryRice233 · Pull Request #4343 · deepspeedai/DeepSpeed

CurryRice233 · 2023-09-15T06:12:46Z

Add NPU FusedAdam support.

* origin/master: (48 commits) Fix autotune to support Triton 2.1 (deepspeedai#4340) Fix skipped inference tests (deepspeedai#4336) Suppress noise (deepspeedai#4310) Fix a bug in the implementation of dequantization for inference (deepspeedai#3433) DS-Chat BLOOM: Fix Attention mask (deepspeedai#4338) clear redundant timers (deepspeedai#4308) Add release version checking (deepspeedai#4328) Fix Zero3 contiguous grads, reduce scatter false accuracy issue (deepspeedai#4321) Clean up modeling code (deepspeedai#4320) Handle empty parameter groups (deepspeedai#4277) Update README.md (deepspeedai#4316) README update (deepspeedai#4303) Update release and bump patch versioning flow (deepspeedai#4286) added a bert-model check for triton (deepspeedai#4266) ZeRO-Inference v2 release bump to 0.10.4 Update index.md (deepspeedai#4297) fix user args parsing of string with spaces on runner (deepspeedai#4265) ZeRO-Inference refresh (deepspeedai#4197) AMD Kernel Compatibility Fixes (deepspeedai#3180) ...

CurryRice233 · 2023-09-18T01:46:43Z

@tjruwase @jeffra @RezaYazdaniAminabadi @cmikeh2 Sorry for annoying, can you guys review this PR ?

* origin/master: Allow multiple inference engines in single script (deepspeedai#4384) adds triton flash attention2 kernel (deepspeedai#4337) Fix llama meta tensor loading in AutoTP and kernel injected inference (deepspeedai#3608) Fix min torch version (deepspeedai#4375) Fix multinode runner to properly append to PDSH_SSH_ARGS_APPEND (deepspeedai#4373) add the missing method (deepspeedai#4363) Openfold fix (deepspeedai#4368) deepspeed4science japanese blog (deepspeedai#4369) deepspeed4science chinese blog (deepspeedai#4366) Enable workflow dispatch on Torch 1.10 CI tests (deepspeedai#4361) Update conda env to have max pydantic version (deepspeedai#4362) add deepspeed4science blog link (deepspeedai#4364) added check to avoid undefined behavior when the input_id length is greater than max_tokens (deepspeedai#4349) Add the policy to run llama model from the official repo (deepspeedai#4313) fix deepspeed4science links (deepspeedai#4358) DeepSpeed4Science (deepspeedai#4357) Support InternLM (deepspeedai#4137) Pass base_dir to model files can be loaded for auto-tp/meta-tensor. (deepspeedai#4348)

ji-huazhong · 2023-10-12T08:50:38Z

@tjruwase Good day. This PR is approved and ready to be merged. Could you retrigger this workflow and merge it? Thanks :-)

tjruwase · 2023-10-12T14:21:12Z

@tjruwase Good day. This PR is approved and ready to be merged. Could you retrigger this workflow and merge it? Thanks :-)

Sorry for the delay, however there seems to be a formatting issue. Please take a look.

ji-huazhong

Resolve format checking errors

accelerator/npu_accelerator.py

op_builder/npu/fused_adam.py

Co-authored-by: Hz, Ji <hzji210@gmail.com>

ji-huazhong

As long as we modify these two blank lines, the format check error should be solved.

accelerator/npu_accelerator.py

tjruwase · 2023-10-14T11:01:34Z

@CurryRice233, it is best to use this guide for formatting issues: https://github.com/microsoft/DeepSpeed/blob/master/CONTRIBUTING.md#prerequisites

Co-authored-by: Hz, Ji <hzji210@gmail.com>

CurryRice233 · 2023-10-17T02:30:26Z

@CurryRice233, it is best to use this guide for formatting issues: https://github.com/microsoft/DeepSpeed/blob/master/CONTRIBUTING.md#prerequisites

Thank you, new skill get😉. By the way, could you retrigger this workflow again?

CurryRice233 · 2023-10-18T02:14:10Z

@tjruwase hi, could you retrigger this workflow again and merge it? Thanks😀

* add npu support dtypes * add npu fused_adam support * add license * Update accelerator/npu_accelerator.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update op_builder/npu/fused_adam.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update accelerator/npu_accelerator.py Co-authored-by: Hz, Ji <hzji210@gmail.com> * Update accelerator/npu_accelerator.py Co-authored-by: Hz, Ji <hzji210@gmail.com> --------- Co-authored-by: jializheng <jializheng@huawei.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Hz, Ji <hzji210@gmail.com>

CurryRice233 added 3 commits August 26, 2023 19:42

add npu support dtypes

e37215d

add npu fused_adam support

ac78f3a

CurryRice233 requested review from RezaYazdaniAminabadi, cmikeh2 and jeffra as code owners September 15, 2023 06:12

Merge branch 'master' into master

fd64619

CurryRice233 and others added 6 commits September 19, 2023 14:11

Merge branch 'master' into master

a43fcf8

Merge branch 'master' into master

0617206

add license

c87ef32

Merge branch 'master' into master

510166e

Merge branch 'master' into master

efe4c12

tjruwase approved these changes Oct 6, 2023

View reviewed changes

Merge branch 'master' into master

51ddc02

Merge branch 'master' into master

b472ecf

ji-huazhong reviewed Oct 14, 2023

View reviewed changes

CurryRice233 and others added 9 commits October 14, 2023 14:41

Update accelerator/npu_accelerator.py

258eac1

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

620b85e

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

dd94fcc

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

b41f20c

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

0487035

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

52a2a8c

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

d808f86

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update op_builder/npu/fused_adam.py

fb77cd0

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Merge branch 'master' into master

38bc926

ji-huazhong reviewed Oct 14, 2023

View reviewed changes

accelerator/npu_accelerator.py Show resolved Hide resolved

accelerator/npu_accelerator.py Outdated Show resolved Hide resolved

CurryRice233 and others added 3 commits October 16, 2023 09:02

Update accelerator/npu_accelerator.py

7b26ce4

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Update accelerator/npu_accelerator.py

da7edd6

Co-authored-by: Hz, Ji <hzji210@gmail.com>

Merge branch 'master' into master

4b6197f

Merge branch 'master' into master

20bd85e

tjruwase enabled auto-merge October 18, 2023 02:34

tjruwase added this pull request to the merge queue Oct 18, 2023

Merged via the queue into deepspeedai:master with commit 3e70a88 Oct 18, 2023

hipudding mentioned this pull request Oct 26, 2023

[Feature package] Full feature support with Ascend NPU #4567

Closed

Comments

Conversation

CurryRice233 commented Sep 15, 2023

Uh oh!

CurryRice233 commented Sep 18, 2023

Uh oh!

ji-huazhong commented Oct 12, 2023

Uh oh!

tjruwase commented Oct 12, 2023

Uh oh!

ji-huazhong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ji-huazhong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tjruwase commented Oct 14, 2023

Uh oh!

CurryRice233 commented Oct 17, 2023

Uh oh!

CurryRice233 commented Oct 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants