Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Characters not handled in the bookmark title #713

Closed
BarbzYHOOL opened this issue Feb 1, 2024 · 2 comments · Fixed by #715
Closed

Characters not handled in the bookmark title #713

BarbzYHOOL opened this issue Feb 1, 2024 · 2 comments · Fixed by #715
Labels

Comments

@BarbzYHOOL
Copy link

BarbzYHOOL commented Feb 1, 2024

Bug reports

image

Problem with some characters, I think it's only in the bookmark title

When I edit the bookmark it shows me "R�pertoire des articles relatifs �" but it should be "Répertoire des articles relatifs à" and it scrapped the name automatically

@LeXofLeviafan
Copy link
Collaborator

Well. Turns out that BeautifulSoup (the library used for parsing HTML) not only automatically converts the input to UTF-8 but also replaces the charset value in <meta> with it, thus invalidating any code that tries to read charset from <meta> 😅

LeXofLeviafan added a commit to LeXofLeviafan/buku that referenced this issue Feb 1, 2024
LeXofLeviafan added a commit to LeXofLeviafan/buku that referenced this issue Feb 2, 2024
@jarun jarun closed this as completed in #715 Feb 3, 2024
jarun added a commit that referenced this issue Feb 3, 2024
@BarbzYHOOL
Copy link
Author

thx

@github-actions github-actions bot locked and limited conversation to collaborators Mar 6, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants