Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Load sharded pt model to flax (from the hub ) #18026

Conversation

ArthurZucker
Copy link
Collaborator

What does this PR do?

Implements the automatic conversion from pt to flax if the model is sharded but does not already have flax weights. This could be useful for automatic conversion

Who can review?

A few tests still need to be implemented, for now works well with hf-internal-testing/tiny-random-bert-sharded

@HuggingFaceDocBuilderDev
Copy link

HuggingFaceDocBuilderDev commented Jul 5, 2022

The documentation is not available anymore as the PR was closed or merged.

@ArthurZucker ArthurZucker self-assigned this Jul 5, 2022
@ArthurZucker
Copy link
Collaborator Author

Follows #17537

@ArthurZucker ArthurZucker added Core: Modeling Internals of the library; Models. Flax labels Jul 5, 2022
@ArthurZucker ArthurZucker marked this pull request as ready for review August 2, 2022 08:36
@ArthurZucker ArthurZucker requested a review from sgugger August 2, 2022 08:52
@ArthurZucker ArthurZucker deleted the convert_sharded_pt_to_sharded_flax branch August 2, 2022 08:53
@ArthurZucker ArthurZucker removed the request for review from sgugger August 2, 2022 09:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Core: Modeling Internals of the library; Models. Flax
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants