Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 #42

Merged
merged 2 commits into from
Nov 20, 2018

Conversation

weiyumou
Copy link
Contributor

I encountered UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 3793: ordinal not in range(128) when running the starter example shown under the Usage section. It turned out to be related to the load_vocab function in tokenization.py. Forcing open to use encoding utf8 solved this issue on my machine.

@thomwolf thomwolf merged commit fd32ebe into huggingface:master Nov 20, 2018
@thomwolf
Copy link
Member

Thanks!

qwang70 pushed a commit to DRL36/pytorch-pretrained-BERT that referenced this pull request Mar 2, 2019
Fixed UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2
stevezheng23 added a commit to stevezheng23/transformers that referenced this pull request Mar 24, 2020
fix loss computing issue for roberta-mt-qa
jameshennessytempus pushed a commit to jameshennessytempus/transformers that referenced this pull request Jun 1, 2023
jonb377 pushed a commit to jonb377/hf-transformers that referenced this pull request Nov 3, 2023
Summary:
This pull request convert the user guide into a mark down and upload
it to GH. The user guide is authored by Jon and me.
ZYC-ModelCloud pushed a commit to ZYC-ModelCloud/transformers that referenced this pull request Nov 14, 2024
ZYC-ModelCloud pushed a commit to ZYC-ModelCloud/transformers that referenced this pull request Nov 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants