-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove deprecated arguments from TE's TransformerLayer #7917
Conversation
Signed-off-by: Jan Baczek <jbaczek@nvidia.com>
the PR changes look fine but we might want to note that this only changes things for NeMo local GPT models, not m-core based ones which are the ones we majorly use now |
I think the main issue is that user are used to configuring this from the config. Now TE expects an environment variable to be set. Mcore makes sure that the env var and the config are consistent. Can NeMo set the env var automatically so that the user doesn't have to set the env var? |
yes, thats right. It would probably be better if we automatically set the TE env variable if someone passes it in the config |
Ideally, I would prefer if this env var setting happens in m-core since m-core also relies on arg passing format and is the place where we actually interface with TE and setup TE/local layers. But if we feel that it might need broader discussion it might make sense to add this in nemo for now so that it unblocks the issue? |
@ericharper @aklife97 can you approve this for now in nemo so it unblocks? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks!
jenkins |
1 similar comment
jenkins |
Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Piotr Żelasko <petezor@gmail.com>
Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Sasha Meister <ameister@nvidia.com>
Signed-off-by: Jan Baczek <jbaczek@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com>
What does this PR do ?
Transformer engine doesn't use this arguments since v0.7 and now in v1.0 they removed them completely.
Collection: [Note which collection this PR will affect]
NLP
Before your PR is "Ready for review"
Pre checks:
PR Type:
Who can review?
@ericharper