support baichuan model: by baodii · Pull Request #4721 · deepspeedai/DeepSpeed

baodii · 2023-11-23T08:48:53Z

fix Baichuan meta data error
add BaichuanLayer and DecoderLayer to glmtype when prepare tp fused qkvw
add get_alibi_mask function for Baichuan to enable TP

* fix Baichuan meta data error * add BaichuanLayer and DecoderLayer to glmtype when prepare tp fused qkvw * add get_alibi_mask function for Baichuan to enable TP

baodii · 2023-11-23T08:49:06Z

@delock

delock · 2023-11-28T03:25:14Z

Hi @RezaYazdaniAminabadi, this PR contains fixes for AutoTP to allow Baichuan model running with DeepSpeed AutoTP, can you review this PR for any comments? Thanks!

RezaYazdaniAminabadi

all LGTM, thanks :)

deepspeed/module_inject/auto_tp_model_utils.py

loadams · 2023-12-08T16:17:10Z

deepspeed/inference/engine.py

            if hasattr(self.module.transformer, 'build_mpt_alibi_tensor'):
                self.module.transformer.build_mpt_alibi_tensor_orig = self.module.transformer.build_mpt_alibi_tensor
                self.module.transformer.__class__.build_mpt_alibi_tensor = build_mpt_alibi_tensor
+        if hasattr(self.module, 'model'):


@baodii - would it be possible to add a unit test for this model support?

@baodii - would it be possible to add a unit test for this model support?

@delock is it necessary to add a single model's autoTP unit test?

I think we should add autoTP tests with a seperate workflow in a seperate PR, there are several reasons:

Current autotp coverage in inference UT is not adequate (marian, codegen only), many popular model are not tested

Popular model has large disk/memory requirement, put them in a seperate workflow allows us to isolate the hardware requirements

It is suggested to test these autotp test with deepspeed launcher rather than pytest distributed launcher, this could avoid additional complexity and as close to user environment as possible.

I think we can open an issue for this task and explore in a seperate task.

This [PR](#4721) added the "DecoderLayer":glmtype. It will cause the Falcon model to choose "glmtype" fused_qkv_type. Falcon model (including Falcondecoderlayer) needs to choose 'bloomtype' explicitly. Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

* fix Baichuan meta data error * add BaichuanLayer and DecoderLayer to glmtype when prepare tp fused qkvw * add get_alibi_mask function for Baichuan to enable TP --------- Co-authored-by: Lai, Yejing <yejing.lai@intel.com> Co-authored-by: Reza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

This [PR](deepspeedai#4721) added the "DecoderLayer":glmtype. It will cause the Falcon model to choose "glmtype" fused_qkv_type. Falcon model (including Falcondecoderlayer) needs to choose 'bloomtype' explicitly. Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

support baichuan model:

190fb75

* fix Baichuan meta data error * add BaichuanLayer and DecoderLayer to glmtype when prepare tp fused qkvw * add get_alibi_mask function for Baichuan to enable TP

baodii requested review from RezaYazdaniAminabadi, arashb, awan-10, cmikeh2, jeffra and mrwyattii as code owners November 23, 2023 08:48

fix baichuan lm_head replace issue

9707285

delock mentioned this pull request Nov 27, 2023

(Do not merge) (CPU) aggregation of few recent fixes/optimizations #3920

Closed

25 tasks

RezaYazdaniAminabadi approved these changes Nov 28, 2023

View reviewed changes

Merge branch 'master' into baichuan_support

0a7e572

loadams reviewed Nov 28, 2023

View reviewed changes

deepspeed/module_inject/auto_tp_model_utils.py Show resolved Hide resolved

baodii and others added 4 commits November 28, 2023 21:41

fix format error

5e05be9

Merge branch 'master' into baichuan_support

4e62acf

Merge branch 'master' into baichuan_support

0cccaea

Merge branch 'master' into baichuan_support

c102f4f

baodii requested review from RezaYazdaniAminabadi and loadams December 8, 2023 02:47

loadams reviewed Dec 8, 2023

View reviewed changes

loadams approved these changes Dec 8, 2023

View reviewed changes

tjruwase and others added 4 commits December 13, 2023 12:37

Merge branch 'master' into baichuan_support

213a014

Merge branch 'master' into baichuan_support

6edb577

Merge branch 'master' into baichuan_support

acf40e7

Merge branch 'master' into baichuan_support

2b724d9

mrwyattii merged commit c20f6fa into deepspeedai:master Dec 18, 2023

delock mentioned this pull request Jan 4, 2024

[TASK] Seperate AutoTP workflow #4894

Open

Yejing-Lai mentioned this pull request Jan 4, 2024

fix falcon-40b accuracy issue #4895

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support baichuan model:#4721

support baichuan model:#4721
mrwyattii merged 11 commits intodeepspeedai:masterfrom
baodii:baichuan_support

baodii commented Nov 23, 2023

Uh oh!

baodii commented Nov 23, 2023

Uh oh!

delock commented Nov 28, 2023

Uh oh!

RezaYazdaniAminabadi left a comment

Uh oh!

Uh oh!

loadams Dec 8, 2023

Uh oh!

baodii Dec 11, 2023

Uh oh!

delock Dec 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

baodii commented Nov 23, 2023

Uh oh!

baodii commented Nov 23, 2023

Uh oh!

delock commented Nov 28, 2023

Uh oh!

RezaYazdaniAminabadi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

loadams Dec 8, 2023

Choose a reason for hiding this comment

Uh oh!

baodii Dec 11, 2023

Choose a reason for hiding this comment

Uh oh!

delock Dec 12, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants