Skip to content

Extend sequence length padding for GPT SFT to account for context parallel#8869

Merged
ericharper merged 3 commits intoNVIDIA:mainfrom vysarge:vsarge/cp_size_paddingMay 6, 2024

Commits

Commits on Apr 18, 2024

Commits on Apr 20, 2024