Update Llama check to use module instead of model #734

lekurile · 2023-09-14T18:24:36Z

This PR updates the Llama check in the DS-Chat Step 3 PPO trainer to use the actor module object instead of model when accessing the configuration. This is necessary since not all model types will work when using model, particularly for the BLOOM model family.

This PR updates the Llama check in the DS-Chat Step 3 PPO trainer to use the actor module object instead of model when accessing the configuration. This is necessary since not all model types will work when using model, particularly for the BLOOM model family.

Update Llama check to use module instead of model

ba1280d

lekurile requested review from RezaYazdaniAminabadi, ShadenSmith, arashb, awan-10, conglongli, duli2012, eltonzheng, jeffra, minjiaz, mrwyattii, samyam, tjruwase, xiaoxiawu-microsoft and yaozhewei as code owners September 14, 2023 18:24

awan-10 approved these changes Sep 14, 2023

View reviewed changes

Update print_throughput_step3 function as well

c4b8a5c

lekurile merged commit bae2afb into master Sep 14, 2023

lekurile mentioned this pull request Sep 14, 2023

[BUG] use bloomz + hybrid_engine, but AttributeError: 'DS_BloomContainer' object has no attribute 'set_params_wo_copy' deepspeedai/DeepSpeed#3518

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Llama check to use module instead of model #734

Update Llama check to use module instead of model #734

Uh oh!

lekurile commented Sep 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update Llama check to use module instead of model #734

Update Llama check to use module instead of model #734

Uh oh!

Conversation

lekurile commented Sep 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants