fix fused_qkv model accuracy issue #5217

Yejing-Lai · 2024-03-01T05:28:47Z

Fused_qkv model can not correctly choose the fused_qkv type. Need to update the module_name_matches.

Yejing-Lai · 2024-03-01T05:29:46Z

Hi @mrwyattii @delock. Please kindly review, Thanks!

delock · 2024-03-01T07:07:18Z

@Yejing-Lai what specific value of module_str and k caused the issue?

Yejing-Lai · 2024-03-01T07:17:49Z

@Yejing-Lai what specific value of module_str and k caused the issue?
It is a logic error. The k is contained in module_str. The module_name_matches cannot get the correct fused_type now. This will cause all models to choose bloom_type.

delock · 2024-03-01T08:35:15Z

if k in module_str does it mean k is a substring of module_str or k is an element of module_str as a list?

Yejing-Lai · 2024-03-02T12:56:05Z

if k in module_str does it mean k is a substring of module_str or k is an element of module_str as a list?
Yes.

Yejing-Lai · 2024-03-04T03:22:53Z

For example:
k = "CodeGenBlock"
module_str="
(module): CodeGenForCausalLM(
(transformer): CodeGenModel(
(wte): Embedding(51200, 2560)
(drop): Dropout(p=0.0, inplace=False)
(h): ModuleList(
(0-31): 32 x CodeGenBlock(
(ln_1): LayerNorm((2560,), eps=1e-05, elementwise_affine=True)
(attn): CodeGenAttention(
(attn_dropout): Dropout(p=0.0, inplace=False)
(resid_dropout): Dropout(p=0.0, inplace=False)
(qkv_proj): LinearLayer()
(out_proj): LinearAllreduce()
)
(mlp): CodeGenMLP(
(fc_in): LinearLayer()
(fc_out): LinearAllreduce()
(act): NewGELUActivation()
(dropout): Dropout(p=0.0, inplace=False)
)
)
)
(ln_f): LayerNorm((2560,), eps=1e-05, elementwise_affine=True)
)
(lm_head): LmHeadLinearAllreduce()
)
"
We need to judge if k in module_str rather than module_str in k.
Thanks.

delock · 2024-03-05T02:44:44Z

Hi @mrwyattii can you help review this PR? This PR fixed an accuracy issue for various models with fused qkv. i.e. Baichuan, code gen, bloom, mpt.

Fused_qkv model can not correctly choose the fused_qkv type. Need to update the module_name_matches. Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

fix fused_qkv model accuracy issue

0dbf97a

Yejing-Lai requested review from mrwyattii, awan-10 and arashb as code owners March 1, 2024 05:28

delock mentioned this pull request Mar 1, 2024

(Do not merge) (CPU) aggregation of few recent fixes/optimizations #3920

Closed

25 tasks

Merge branch 'master' into lyj/fix_fused_qkv

1869697

tjruwase approved these changes Mar 5, 2024

View reviewed changes

loadams added this pull request to the merge queue Mar 5, 2024

Merged via the queue into microsoft:master with commit bc0d246 Mar 6, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix fused_qkv model accuracy issue #5217

fix fused_qkv model accuracy issue #5217

Yejing-Lai commented Mar 1, 2024

Yejing-Lai commented Mar 1, 2024

delock commented Mar 1, 2024

Yejing-Lai commented Mar 1, 2024

delock commented Mar 1, 2024

Yejing-Lai commented Mar 2, 2024

Yejing-Lai commented Mar 4, 2024

delock commented Mar 5, 2024

fix fused_qkv model accuracy issue #5217

fix fused_qkv model accuracy issue #5217

Conversation

Yejing-Lai commented Mar 1, 2024

Yejing-Lai commented Mar 1, 2024

delock commented Mar 1, 2024

Yejing-Lai commented Mar 1, 2024

delock commented Mar 1, 2024

Yejing-Lai commented Mar 2, 2024

Yejing-Lai commented Mar 4, 2024

delock commented Mar 5, 2024