Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert HTML pages into Unicode #8

Open
hpreusse opened this issue Apr 12, 2021 · 0 comments
Open

Convert HTML pages into Unicode #8

hpreusse opened this issue Apr 12, 2021 · 0 comments

Comments

@hpreusse
Copy link
Contributor

hpreusse commented Apr 12, 2021

Our nice quality checker lintian reports that some (generated) HTML files contain national encoding, many files for the french part of the docs. Indeed, "file" prints something like

hille@debian-amd64-sid:~/devel/zzz_empty/MWE$ file GoingfurtherOthertools.html
GoingfurtherOthertools.html: HTML document, Non-ISO extended-ASCII text

According to the tex4ht people, one needs to specify to create utf-8 at the command line or one should use make4ht instead. I tested both, I can confirm that both methods works; in addition the tex4ht.env needs to be removed.

Based on this I created a patch (the alternative option make4ht is contained but commented). If you need me to create a pull request for this patch, call back.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant