Adding MT5 support #629

sotwi · 2024-01-05T14:06:19Z

Pull request to address #568.

I followed the updated guide for Adapters to a Model and did a very quick port. I followed the approach that the mBART implementation took(they reused the BART mixins, I reused the T5 mixins) so the changes were minimal.

I hope it works.

…nt T5 mentions to MT5

sotwi · 2024-01-05T22:49:28Z

There appears to be an issues when loading the public mt5 weights into the AdapterModel.
I have no idea what it is, but I think it has to do with those weights already including a lm_head.weight layer in their state dict. That seems to cause issues when initializing the Flexible Heads.

sotwi · 2024-01-08T14:53:27Z

When I try to load public mt5 weights (say mt5-small) with either AutoAdapterModel or MT5AdapterModel I get the following error:

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/wsotomar/anaconda3/envs/dev_adapters/lib/python3.8/site-packages/transformers/models/auto/auto_factory.py", line 566, in from_pretrained
    return model_class.from_pretrained(
  File "/home/wsotomar/anaconda3/envs/dev_adapters/lib/python3.8/site-packages/transformers/modeling_utils.py", line 3480, in from_pretrained
    ) = cls._load_pretrained_model(
  File "/home/wsotomar/Code/adapters/src/adapters/heads/base.py", line 969, in _load_pretrained_model
    return super()._load_pretrained_model(
  File "/home/wsotomar/anaconda3/envs/dev_adapters/lib/python3.8/site-packages/transformers/modeling_utils.py", line 3752, in _load_pretrained_model
    raise ValueError(
ValueError: The state dictionary of the model you are trying to load is corrupted. Are you sure it was properly saved?

I then tried loading the weights with the transformers MT5Model class, which gets rid of the lm_head layer that comes prepackaged with the weights. I saved that model locally and then loaded it with both AutoAdapterModel and MT5AdapterModel without any issues.

I really think that the inclusion of the lm_head on the original weights is messing with the loading process of ModelWithFlexibleHeadsAdaptersMixin and that's why its failing. But I can't quite find the exact issue nor how to fix it.

If anyone wants to have a look at it that would be great.

calpt · 2024-01-17T19:58:30Z

@sotwi thanks so much for your work on this so far! will look into the issue you mentioned shortly

calpt · 2024-01-26T22:48:11Z

Thanks again for working on this. I've looked into the issue and am working fixing this separately in #640 (for both the failing tests and the lm_head error). Once the fix there is ready, this PR is good to merge from my side!

sotwi · 2024-01-29T12:30:30Z

Thank you for your help @calpt!!!
I am glad it is working now!

William added 10 commits January 5, 2024 14:56

Coppied required T5 files into an MT5 directory and refactored releva…

6570e5d

…nt T5 mentions to MT5

Fixed code quality errors

06f1a7d

Added MT5 support to the AutoAdapter class

0edbd3b

Fixed wrong path to mt5 tokenizer

6d1bf02

Added MT5ADapterModel to adapters.__init__

9633b94

Import MT5AdapterModel in adapters.__init__

ab19682

Fixed MT5 Test from hf_transformers

84288ac

Add mt5 to Parallel supported models

19f4506

Add mt5 to the mapping of flex heads

39769cf

Fix style typo on adapters.head_utils

2feb7f6

calpt linked an issue Jan 5, 2024 that may be closed by this pull request

Add mt5 support #568

Closed

3 tasks

sotwi force-pushed the main branch from c514013 to 2feb7f6 Compare January 8, 2024 14:36

calpt mentioned this pull request Jan 26, 2024

Fix prediction head loading for T5 #640

Merged

calpt approved these changes Jan 26, 2024

View reviewed changes

Merge branch 'main' into sotwi-main

fed5dba

calpt merged commit 5f91178 into adapter-hub:main Jan 28, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding MT5 support #629

Adding MT5 support #629

sotwi commented Jan 5, 2024

sotwi commented Jan 5, 2024

sotwi commented Jan 8, 2024 •

edited

Loading

calpt commented Jan 17, 2024

calpt commented Jan 26, 2024

sotwi commented Jan 29, 2024

Adding MT5 support #629

Adding MT5 support #629

Conversation

sotwi commented Jan 5, 2024

sotwi commented Jan 5, 2024

sotwi commented Jan 8, 2024 • edited Loading

calpt commented Jan 17, 2024

calpt commented Jan 26, 2024

sotwi commented Jan 29, 2024

sotwi commented Jan 8, 2024 •

edited

Loading