-
Notifications
You must be signed in to change notification settings - Fork 27.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[TFBart] Split TF-Bart #9497
[TFBart] Split TF-Bart #9497
Conversation
a82cf67
to
1b2bc4f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome work!! Just left few smalll comments. I think we should first find a proper fix #9478 and then merging this one. Switching on/off some tests everytime we touch a model is really not a long term solution, I think a proper template as to be stated first and then afterwards we do the models.
IMO this PR should be merged and the s2s fix should be applied afterward as said offline. This PR is blocking a new release currently |
Ok, nevermind, I didn't know you wanted to have it in the next release. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Left a few nits, but great work all around! Thank you for taking care of this.
src/transformers/models/blenderbot_small/modeling_tf_blenderbot_small.py
Outdated
Show resolved
Hide resolved
src/transformers/models/blenderbot_small/modeling_tf_blenderbot_small.py
Outdated
Show resolved
Hide resolved
@slow | ||
class TFBartModelIntegrationTest(unittest.TestCase): | ||
def test_inference_no_head(self): | ||
model = TFBartModel.from_pretrained("facebook/bart-large", from_pt=True) | ||
model = TFBartForConditionalGeneration.from_pretrained("facebook/bart-large").model |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why not use TFBartModel
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
doesn't work, the weights don't load correctly...this didn't work previously either. Could be dealt with in a future PR
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks a lot for doing this! There are just a few repeated mistakes, that probably come from the templates.
src/transformers/models/blenderbot_small/modeling_tf_blenderbot_small.py
Outdated
Show resolved
Hide resolved
src/transformers/models/blenderbot_small/modeling_tf_blenderbot_small.py
Outdated
Show resolved
Hide resolved
src/transformers/models/blenderbot_small/modeling_tf_blenderbot_small.py
Outdated
Show resolved
Hide resolved
* make templates ready * make add_new_model_command_ready * finish tf bart * prepare tf mbart * finish tf bart * add tf mbart * add marian * prep pegasus * add tf pegasus * push blenderbot tf * add blenderbot * add blenderbot small * clean-up * make fix copy * define blend bot tok * fix * up * make style * add to docs * add copy statements * overwrite changes * improve * fix docs * finish * fix last slow test * fix missing git conflict line * fix blenderbot * up * fix blenderbot small * load changes * finish copied from * upload fix
What does this PR do?
TF mirror of: #9343
After PR is merged TODO:
facebook/blenderbot_small-90M
tokenizer - cannot download files from hub. Weird issue