-
Notifications
You must be signed in to change notification settings - Fork 27.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Download model too slow, is there any way #1934
Comments
If your model download is too slow and fails, you can manually download it from our S3 using your browser, wget or cURL as an alternative method. You can then point to a directory that has both the model weights (xxx-pytorch_model.bin) and the configuration file (xxx-config.json) instead of the checkpoint name as the argument for |
OSError: Couldn't reach server at 'https://s3.amazonaws.com/models.huggingface.co/bert/bert-base-cased-config.json' to download pretrained model configuration file. |
Can you open this https://s3.amazonaws.com/models.huggingface.co/bert/bert-base-cased-config.json in your browser ? |
It can be opened in a browser |
in run_lm_finetuning.py, |
Tipically, when you say masked *, you want to use boolean values (0 for absence and 1 for presence). In this particular case (rows n.144-151), you are sampling some tokens in the in each sequence for masked language modeling. For this reason, the probability_matrix variable is being set to boolean values. In fact, the first argument of the masked_fill() method is a boolean Torch tensor (i.e. the boolean vector). You can read more info in the PyTorch docs here. For what concern your issue, post the code for reproducibility and version of TensorFlow, PyTorch, Transformers.
|
@bigzhouj this is probably due to a Pytorch version error. I believe |
in run_lm_finetuning.py
transformers.file_utils - https://s3.amazonaws.com/models.huggingface.co/bert/bert-base-cased-pytorch_model.bin not found in cache or force_download set to True, downloading to ....
The text was updated successfully, but these errors were encountered: