deepspeed-chat: train v_head when only optimizing lora #758

mosheisland · 2023-10-05T05:45:03Z

When using only optimize lora, we still need to train the v_head parameter.

Change-Id: I252c3ee69819997bf336482c6779b070f2e76df8

When using only optimize lora, we still need to train the v_head parameter. Change-Id: I252c3ee69819997bf336482c6779b070f2e76df8 Signed-off-by: Moshe Island <misland@habana.ai>

When using only optimize lora, we still need to train the v_head parameter. Change-Id: I252c3ee69819997bf336482c6779b070f2e76df8 Signed-off-by: Moshe Island <misland@habana.ai> Co-authored-by: Moshe Island <misland@habana.ai> Co-authored-by: Lev Kurilenko <113481193+lekurile@users.noreply.github.com>

mosheisland requested review from RezaYazdaniAminabadi, ShadenSmith, arashb, awan-10, conglongli, duli2012, eltonzheng, jeffra, minjiaz, mrwyattii, samyam, tjruwase, xiaoxiawu-microsoft and yaozhewei as code owners October 5, 2023 05:45

tjruwase requested review from lekurile and removed request for RezaYazdaniAminabadi, ShadenSmith, arashb, awan-10, conglongli, duli2012, eltonzheng, jeffra, minjiaz, mrwyattii, samyam, xiaoxiawu-microsoft and yaozhewei October 5, 2023 20:33

deepspeed-chat: train v_head when only optimizing lora

679e800

When using only optimize lora, we still need to train the v_head parameter. Change-Id: I252c3ee69819997bf336482c6779b070f2e76df8 Signed-off-by: Moshe Island <misland@habana.ai>

mosheisland force-pushed the 8_train_v_head_lora branch from 424049f to 679e800 Compare October 12, 2023 12:47

lekurile approved these changes Oct 13, 2023

View reviewed changes

Merge branch 'master' into 8_train_v_head_lora

761366d

tjruwase merged commit 5161c0f into deepspeedai:master Oct 16, 2023

mosheisland deleted the 8_train_v_head_lora branch October 17, 2023 06:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepspeed-chat: train v_head when only optimizing lora #758

deepspeed-chat: train v_head when only optimizing lora #758

Uh oh!

mosheisland commented Oct 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

deepspeed-chat: train v_head when only optimizing lora #758

deepspeed-chat: train v_head when only optimizing lora #758

Uh oh!

Conversation

mosheisland commented Oct 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants