Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error Instantiating T5-11B from conributed models #5423

Closed
lordtt13 opened this issue Jul 1, 2020 · 3 comments
Closed

Error Instantiating T5-11B from conributed models #5423

lordtt13 opened this issue Jul 1, 2020 · 3 comments

Comments

@lordtt13
Copy link
Contributor

lordtt13 commented Jul 1, 2020

🐛 Bug

Information

Model I am using : T5-11B

Language I am using the model on: English

The problem arises when using:
when I try downloading the T5-11B model

The tasks I am working on is:
Evaluating ROGUE score on CNN dataset

To reproduce

Steps to reproduce the behavior:
Just try instantiating the T5-11B model using the AutoModel Class

Error Message:
OSError: Can't load weights for 't5-11b'. Make sure that:

  • 't5-11b' is a correct model identifier listed on 'https://huggingface.co/models'

  • or 't5-11b' is the correct path to a directory containing a file named one of pytorch_model.bin, tf_model.h5, model.ckpt.

Expected behavior

Would instatntiate the

Environment info

  • transformers version: 2.11.0
  • Platform: Linux-5.3.0-61-generic-x86_64-with-glibc2.10
  • Python version: 3.8.3
  • PyTorch version (GPU?): 1.5.0 (True)
  • Tensorflow version (GPU?): 2.2.0 (False)
  • Using GPU in script?: Yes
  • Using distributed or parallel set-up in script?: No
@shamanez
Copy link
Contributor

same result. I can't download

@julien-c
Copy link
Member

Please see #5986 (comment)

@lordtt13
Copy link
Contributor Author

Works when I use:

import transformers

t5 = transformers.AutoModel.from_pretrained('t5-11b', use_cdn = False)

Thank You!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants