Skip to content

fix falcon-40b accuracy issue#4895

Merged
mrwyattii merged 3 commits intodeepspeedai:masterfrom
Yejing-Lai:lyj/falcon_accuracy
Jan 10, 2024
Merged

fix falcon-40b accuracy issue#4895
mrwyattii merged 3 commits intodeepspeedai:masterfrom
Yejing-Lai:lyj/falcon_accuracy

Conversation

@Yejing-Lai
Copy link
Contributor

This PR added the "DecoderLayer":glmtype. It will cause the Falcon model to choose "glmtype" fused_qkv_type. Falcon model (including Falcondecoderlayer) needs to choose 'bloomtype' explicitly.

@delock
Copy link
Collaborator

delock commented Jan 5, 2024

Hi @Yejing-Lai to better understand the accuracy issue this PR intend to fix, can you post a reproducer script and add output before/after the fix? Thanks!

@Yejing-Lai
Copy link
Contributor Author

Hi @mrwyattii. From the failure log it seems like a device space issue. Can you check this issue? Thanks!

@mrwyattii mrwyattii enabled auto-merge January 9, 2024 22:32
@mrwyattii mrwyattii disabled auto-merge January 10, 2024 00:45
@mrwyattii mrwyattii merged commit 16c265c into deepspeedai:master Jan 10, 2024
mauryaavinash95 pushed a commit to mauryaavinash95/DeepSpeed that referenced this pull request Feb 17, 2024
This [PR](deepspeedai#4721) added the
"DecoderLayer":glmtype. It will cause the Falcon model to choose
"glmtype" fused_qkv_type. Falcon model (including Falcondecoderlayer)
needs to choose 'bloomtype' explicitly.

Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants