Skip to content

Add support for overlapped gradient and parameter synchronization for GPT SFT model#10041

Merged
cuichenx merged 3 commits intomainfrom mfutrega/mcore_dist_optAug 6, 2024

Commits

Commits on Aug 5, 2024