Skip to content

Conversation

@ys950902
Copy link
Contributor

Hi, this pr is to enable bf16 on deepspeedChat for RLHF training, the config function(get_train_ds_config/get_eval_ds_config) is fixed to FP16 before, add flag "--bf16" to enable bf16 as a option to use, if not add it, fp16 is by default like before.

@ys950902
Copy link
Contributor Author

Hi, for this pr, I have verified on my side with the script you give for all tree training steps on deepspeed-chat, it can enable bfloat16 with flag "--bf16" added, if not add this flag is by default fp16, if you have some suggestions or update, please comment here, many thanks.

@ys950902
Copy link
Contributor Author

Hi @tjruwase, thanks for your feedback, I have followed your suggestions to update this issue, so that can expand easily when more data_type needed in the future, could you please also take a look on it. Thanks a lot!

@tjruwase
Copy link
Contributor

@ys950902, thanks so much for this great PR.

@tjruwase tjruwase merged commit 9b3d898 into deepspeedai:master Sep 22, 2023
hwchen2017 pushed a commit that referenced this pull request Jun 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants