fix falcon-40b accuracy issue by Yejing-Lai · Pull Request #4895 · deepspeedai/DeepSpeed

Yejing-Lai · 2024-01-04T07:49:26Z

This PR added the "DecoderLayer":glmtype. It will cause the Falcon model to choose "glmtype" fused_qkv_type. Falcon model (including Falcondecoderlayer) needs to choose 'bloomtype' explicitly.

delock · 2024-01-05T01:03:01Z

Hi @Yejing-Lai to better understand the accuracy issue this PR intend to fix, can you post a reproducer script and add output before/after the fix? Thanks!

Yejing-Lai · 2024-01-09T05:32:16Z

Hi @mrwyattii. From the failure log it seems like a device space issue. Can you check this issue? Thanks!

This [PR](deepspeedai#4721) added the "DecoderLayer":glmtype. It will cause the Falcon model to choose "glmtype" fused_qkv_type. Falcon model (including Falcondecoderlayer) needs to choose 'bloomtype' explicitly. Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

fix falcon-40b accuracy issue

9b77d8c

Yejing-Lai requested review from arashb, awan-10 and mrwyattii as code owners January 4, 2024 07:49

This was referenced Jan 5, 2024

(Do not merge) (CPU) aggregation of few recent fixes/optimizations #3920

Closed

[BUG] Falcon-40 have accuracy issue when using autotp #4903

Closed

Merge branch 'master' into lyj/falcon_accuracy

0c035dd

mrwyattii enabled auto-merge January 9, 2024 22:32

mrwyattii disabled auto-merge January 10, 2024 00:45

Merge branch 'master' into lyj/falcon_accuracy

248c41d

mrwyattii merged commit 16c265c into deepspeedai:master Jan 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix falcon-40b accuracy issue#4895

fix falcon-40b accuracy issue#4895
mrwyattii merged 3 commits intodeepspeedai:masterfrom
Yejing-Lai:lyj/falcon_accuracy

Yejing-Lai commented Jan 4, 2024

Uh oh!

delock commented Jan 5, 2024

Uh oh!

Yejing-Lai commented Jan 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Yejing-Lai commented Jan 4, 2024

Uh oh!

delock commented Jan 5, 2024

Uh oh!

Yejing-Lai commented Jan 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants