-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New mcore transformer block spec #9035
New mcore transformer block spec #9035
Conversation
This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days. |
This PR was closed because it has been inactive for 7 days since being marked as stale. |
nemo/collections/nlp/models/language_modeling/megatron/gpt_full_te_layer_autocast_spec.py
Dismissed
Show dismissed
Hide dismissed
This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days. |
This PR was closed because it has been inactive for 7 days since being marked as stale. |
* update package info (#8793) Signed-off-by: eharper <eharper@nvidia.com> * update mcore (#8917) Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Use new mcore transformer block config handling Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * API fixes Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert chages to CI and Dockerfile Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com>
0a2c85c
to
a7ae71d
Compare
Signed-off-by: Jan Baczek <jbaczek@nvidia.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thank you!
* New mcore transformer block spec (NVIDIA#8925) * update package info (NVIDIA#8793) Signed-off-by: eharper <eharper@nvidia.com> * update mcore (NVIDIA#8917) Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Use new mcore transformer block config handling Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * API fixes Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert chages to CI and Dockerfile Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Adjust function calls after branch rebase Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: jbaczek <45043825+jbaczek@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jan Baczek <jbaczek@nvidia.com> Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
* New mcore transformer block spec (NVIDIA#8925) * update package info (NVIDIA#8793) Signed-off-by: eharper <eharper@nvidia.com> * update mcore (NVIDIA#8917) Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Use new mcore transformer block config handling Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * API fixes Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert chages to CI and Dockerfile Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Adjust function calls after branch rebase Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: jbaczek <45043825+jbaczek@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jan Baczek <jbaczek@nvidia.com> Signed-off-by: Vivian Chen <xuanzic@example.com>
…f transformer_config.num_moe_experts NVIDIA/NeMo#9035
…f transformer_config.num_moe_experts NVIDIA/NeMo#9035
…f transformer_config.num_moe_experts NVIDIA/NeMo#9035
…f transformer_config.num_moe_experts NVIDIA/NeMo#9035
…f transformer_config.num_moe_experts NVIDIA/NeMo#9035
* New mcore transformer block spec (#8925) * update package info (#8793) Signed-off-by: eharper <eharper@nvidia.com> * update mcore (#8917) Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Use new mcore transformer block config handling Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * API fixes Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert chages to CI and Dockerfile Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Adjust function calls after branch rebase Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: jbaczek <45043825+jbaczek@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jan Baczek <jbaczek@nvidia.com>
…f transformer_config.num_moe_experts NVIDIA/NeMo#9035
…f transformer_config.num_moe_experts NVIDIA/NeMo#9035 Signed-off-by: Terry Kong <terryk@nvidia.com>
…f transformer_config.num_moe_experts NVIDIA/NeMo#9035 Signed-off-by: Terry Kong <terryk@nvidia.com>
* New mcore transformer block spec (NVIDIA#8925) * update package info (NVIDIA#8793) Signed-off-by: eharper <eharper@nvidia.com> * update mcore (NVIDIA#8917) Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Use new mcore transformer block config handling Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * API fixes Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert chages to CI and Dockerfile Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Adjust function calls after branch rebase Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: jbaczek <45043825+jbaczek@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jan Baczek <jbaczek@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com>
* New mcore transformer block spec (NVIDIA#8925) * update package info (NVIDIA#8793) Signed-off-by: eharper <eharper@nvidia.com> * update mcore (NVIDIA#8917) Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Use new mcore transformer block config handling Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * API fixes Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert chages to CI and Dockerfile Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> * Adjust function calls after branch rebase Signed-off-by: Jan Baczek <jbaczek@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com> Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: jbaczek <45043825+jbaczek@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jan Baczek <jbaczek@nvidia.com>
What does this PR do ?
This PR leverages new mcore gpt spec handling. (not yet merged to mcore)
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use this
Jenkins CI
To run Jenkins, a NeMo User with write access must comment
jenkins
on the PR.Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information