Fix UnicodeDecodeError for Chinese #75

Brightcells · 2018-03-14T10:51:11Z

ipdb> value
'\xe8\xa5\xbf\xe6\xb1\x89\xe6\x96\x87\xe5\xad\xa6\xe6\x88\x90\xe5\xb0\xb1\xe4\xb8\xad\xef\xbc\x8c\xe6\x9c\x80\xe4\xb8\xba\xe7\xaa\x81\xe5\x87\xba\xe7\x9a\x84\xe6\x98\xaf\xef\xbc\x9f'
ipdb> unicode(value)
*** UnicodeDecodeError: 'ascii' codec can't decode byte 0xe8 in position 0: ordinal not in range(128)
ipdb> cc.Convert2Unicode(value)

u'\u897f\u6c49\u6587\u5b66\u6210\u5c31\u4e2d\uff0c\u6700\u4e3a\u7a81\u51fa\u7684\u662f\uff1f'
ipdb> print cc.Convert2Unicode(value)

西汉文学成就中，最为突出的是？
ipdb>

…n position 0: ordinal not in range(128)

Parkayun · 2018-03-15T08:11:06Z

@Brightcells
Hi. Can I know why use CodeConvert instead of to write function?
I want this package has minimum dependency.
And test failed.

Brightcells · 2018-03-15T13:36:21Z

Chinese character may be utf8 or gbk.
I does not to test whether Chinese character in .bson file can be gbk or not.
Instead I use CodeConvert to realize convert to unicode.

Brightcells · 2018-03-15T14:06:30Z

Fixed travis tests

🐛 Fix Bug: UnicodeDecodeError: 'ascii' codec can't decode byte 0xe8 i…

02286fd

…n position 0: ordinal not in range(128)

Brightcells added 2 commits March 15, 2018 21:44

👷 Update .travis.yml

33d8e30

⬆️ Upgrading CodeConvert

0a41b21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix UnicodeDecodeError for Chinese #75

Fix UnicodeDecodeError for Chinese #75

Brightcells commented Mar 14, 2018

Parkayun commented Mar 15, 2018

Brightcells commented Mar 15, 2018

Brightcells commented Mar 15, 2018

Fix UnicodeDecodeError for Chinese #75

Are you sure you want to change the base?

Fix UnicodeDecodeError for Chinese #75

Conversation

Brightcells commented Mar 14, 2018

Parkayun commented Mar 15, 2018

Brightcells commented Mar 15, 2018

Brightcells commented Mar 15, 2018