Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Correctly show/store latin1 encodet documents. #57

Open
cphsolutionslab opened this issue Oct 9, 2013 · 1 comment
Open

Correctly show/store latin1 encodet documents. #57

cphsolutionslab opened this issue Oct 9, 2013 · 1 comment

Comments

@cphsolutionslab
Copy link

Is it possible to tell the DataStorer to accept and correctly encode latin1-sources/documents?

Currently it does not show latin1-encodet files correctly, only utf-8.

@cphsolutionslab
Copy link
Author

I've tried to "hack" the datastorer to accept latin1 by adding:
chunk = chunk.decode('latin1').encode('utf-8')
in the fetch_resource.py.

That acutally did the job for a while. But after some days (yes, I know this sounds weird) it fails again. It seems to be messytables (any_tables) in commands.py which removes international characters. When opening the file in commands.py my Danish characters are correctly shown. After running any_tables, the characters are removed in row_set.

Can someone help?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant