-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pre-generate cu_seqlens argmin and max_seqlen to remove host-to-device sync #8108
Conversation
jenkins |
cb68ea6
to
b44a596
Compare
jenkins |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM thanks!
jenkins |
@ericharper |
Just waiting on internal CI tests. They look good so far. We'll most likely merge first thing tomorrow. |
jenkins |
bd87cb1
to
bec9217
Compare
jenkins |
…e data copies Signed-off-by: Sangkug Lym <slym@nvidia.com> [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci
bec9217
to
1c09d92
Compare
jenkins |
jenkins |
jenkins |
jenkins |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks!
Ran nemo-ci as well and all tests passed.
…e data copies (NVIDIA#8108) Signed-off-by: Sangkug Lym <slym@nvidia.com> [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Eric Harper <complex451@gmail.com>
…e data copies (NVIDIA#8108) Signed-off-by: Sangkug Lym <slym@nvidia.com> [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Eric Harper <complex451@gmail.com>
…e data copies (#8108) Signed-off-by: Sangkug Lym <slym@nvidia.com> [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: stevehuang52 <heh@nvidia.com>
…e data copies (NVIDIA#8108) Signed-off-by: Sangkug Lym <slym@nvidia.com> [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Sasha Meister <ameister@nvidia.com>
…e data copies (#8108) Signed-off-by: Sangkug Lym <slym@nvidia.com> [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Pablo Garay <pagaray@nvidia.com>
…e data copies (NVIDIA#8108) Signed-off-by: Sangkug Lym <slym@nvidia.com> [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Eric Harper <complex451@gmail.com>
…e data copies
What does this PR do ?
At SFT, pre-generate
cu_seqlens_argmin
andmax_seqlen
to skip CPU-side computation on the main process.This removes host-to-device sync, which is performance-critical when each kernel execution is short.
Changelog
Usage
# Add a code snippet demonstrating how to use this
Jenkins CI
To run Jenkins, a NeMo User with write access must comment
jenkins
on the PR.Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information