Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[megatron convert] PYTHONPATH requirements #14956

Merged
merged 2 commits into from
Jan 5, 2022

Conversation

stas00
Copy link
Contributor

@stas00 stas00 commented Dec 28, 2021

This PR documents how to tell the megatron conversion scripts to find the Megatron-LM repo, which is needed for recent checkpoints as it was reported at #14939

Since one can't install Megatron-LM as a package we can't make the script require it, so documenting it here and will also document in the model cards.

Fixes: #14939

@LysandreJik


@jdemouth already updated these:

I need to figure out how to get perms to do so.

Copy link
Contributor

@jdemouth jdemouth left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It looks good to me.

@stas00 stas00 requested a review from LysandreJik January 4, 2022 17:57
Copy link
Member

@LysandreJik LysandreJik left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the addition, @stas00!

@LysandreJik LysandreJik merged commit 927f654 into huggingface:master Jan 5, 2022
@stas00 stas00 deleted the meg-convert-pythonpath branch January 5, 2022 17:53
stevhliu pushed a commit to stevhliu/transformers that referenced this pull request Jan 6, 2022
* [megatron convert] PYTHONPATH requirements

* more info
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Cannot Convert Megatron GPT checkpoint
3 participants