Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add support for overlapped gradient and parameter synchronization for…
… GPT SFT model (#10041) * Add support for overlapped gradient and parameter synchronization for GPT SFT model Signed-off-by: Michal Futrega <mfutrega@nvidia.com> * Add finalize_model_grads * Apply isort and black reformatting Signed-off-by: michal2409 <michal2409@users.noreply.github.com> --------- Signed-off-by: Michal Futrega <mfutrega@nvidia.com> Signed-off-by: michal2409 <michal2409@users.noreply.github.com> Co-authored-by: michal2409 <michal2409@users.noreply.github.com>
- Loading branch information