-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
latin chars discarded #43
Comments
It seems a bit silly to always fall back to utf-8 when no character detection libraries are installed. |
That's a really good point @LovecraftianHorror, I didn't realise we had that info on the response. @jprabino could you change your PR #44 for testing this? Thanks to both! |
Got it. Thanks @LovecraftianHorror! |
@LovecraftianHorror can you give an example? Here's the result of that. `self.data 'status':'200 OK' 'data' inside self.data has the subtitles in base64, by the time we convert from base64 we should have the encoding already. |
Oh sorry I should have looked into the library more to see how downloading actually worked. The If you want another reference there's me getting the Hopefully, that gives some options. I should have looked into the code a bit more before commenting the first time. |
Thanks for the insights. 👍 |
The detect function can't always guess the right encoding, so the decompress methods falls back to 'utf-8' and ignoring the errors. Therefore the latin chars get discarded.
There should be an optional param to force the decoding to be used when decoding the data.
The text was updated successfully, but these errors were encountered: