Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement GPT2 Backbone #7894

Merged
merged 52 commits into from
Aug 18, 2023
Merged

Conversation

pforderique
Copy link
Contributor

Implements the GPT2Backbone layer based off KerasNLP.

Dependencies: #7890, #7887

Note: presets is unfinished and is pending conversation about how to properly load in weights, vocab, merges, architecture info, etc.

Copy link
Member

@mattsoulanille mattsoulanille left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM with some longer term questions for us to think about in model loading.

@pforderique pforderique enabled auto-merge (squash) August 18, 2023 00:25
@pforderique pforderique merged commit 0cd53ba into tensorflow:master Aug 18, 2023
@pforderique pforderique deleted the gpt2backbone-impl branch August 18, 2023 01:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants