Fix Unicode normalization on macOS #1063

Snack-X · 2016-12-01T07:48:28Z

This is an fix for #586.

Incorrect rendering before normalization (current behavior):

and correct rendering with normalization:

I just met with this nice terminal and haven't look at the codebase deeply.
So adding normalization here may not be appropriate. There may be an unexpected side effects.
Honestly, I'm not sure.

albinekb · 2017-02-26T01:17:03Z

this is a good read about it: http://unicode.org/faq/normalization.html

chabou · 2017-08-14T09:06:31Z

Sorry for the delay.

@Snack-X thank you so much for your PR. Can you attach a txt file with your ls output in order to reproduce it (and test it with xterm, our hterm replacement)?

Snack-X · 2017-08-16T05:36:50Z

This is an old issue, but problem still remains. Original PR was made before the 1.0.0 release. Now times have passed and this PR won't be compatible with recent codebase.

Anyway, you can easily reproduce the same problem with reverse xxd.

First, hexadecimal values of each string.

> Buffer.from("にっぽん", "utf8")
<Buffer e3 81 ab e3 81 a3 e3 81 bd e3 82 93>
> Buffer.from("にっぽん".normalize("NFD"), "utf8")
<Buffer e3 81 ab e3 81 a3 e3 81 bb e3 82 9a e3 82 93>
> Buffer.from("ニッポン", "utf8")
<Buffer e3 83 8b e3 83 83 e3 83 9d e3 83 b3>
> Buffer.from("ニッポン".normalize("NFD"), "utf8")
<Buffer e3 83 8b e3 83 83 e3 83 9b e3 82 9a e3 83 b3>
> Buffer.from("대한민국", "utf8")
<Buffer eb 8c 80 ed 95 9c eb af bc ea b5 ad>
> Buffer.from("대한민국".normalize("NFD"), "utf8")
<Buffer e1 84 83 e1 85 a2 e1 84 92 e1 85 a1 e1 86 ab e1 84 86 e1 85 b5 e1 86 ab e1 84 80 e1 85 ae e1 86 a8>

And using reverse xxd, problem is reproducible.

$ echo "e3 81 ab e3 81 a3 e3 81 bd e3 82 93 0a" | xxd -r -p
にっぽん
$ echo "e3 81 ab e3 81 a3 e3 81 bb e3 82 9a e3 82 93 0a" | xxd -r -p
にっぽん
$ echo "e3 83 8b e3 83 83 e3 83 9d e3 83 b3 0a" | xxd -r -p
ニッポン
$ echo "e3 83 8b e3 83 83 e3 83 9b e3 82 9a e3 83 b3 0a" | xxd -r -p
ニッポン
$ echo "eb 8c 80 ed 95 9c eb af bc ea b5 ad 0a" | xxd -r -p
대한민국
$ echo "e1 84 83 e1 85 a2 e1 84 92 e1 85 a1 e1 86 ab e1 84 86 e1 85 b5 e1 86 ab e1 84 80 e1 85 ae e1 86 a8 0a" | xxd -r -p
대한민국

Screenshots below are the comparison of three terminal softwares, Terminal.app (bundled with macOS 10.12.6), iTerm 2 (latest beta), and Hyper (1.3.3.1754).

Besides the incorrect rendering(why it got worse?), Terminal and iTerm handles it perfect, while Hyper does not.

If you have problem with font, you can use D2Coding, a monospace font supports Korean Hangul, Japanese Hiragana and Katakana, and CJK Ideographs.

Snack-X · 2017-08-16T05:38:26Z

Looks like rendering issue is related with #1535, and corresponding PR is made as #2000.

Snack-X · 2017-09-25T08:35:46Z

After almost a year, I can confirm this issue is fixed with latest 2.0.3.

Everything is rendered as expected, no normalization issue. That took long.

You can close this PR if you wish.

albinekb · 2017-09-25T09:57:23Z

We needed to change from hterm to xterm, that's why it took so long @Snack-X

Thanks for your PR though! ❤️

Fix Unicode normalization on macOS

0d2d400

albinekb closed this Sep 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Unicode normalization on macOS #1063

Fix Unicode normalization on macOS #1063

Snack-X commented Dec 1, 2016

albinekb commented Feb 26, 2017

chabou commented Aug 14, 2017

Snack-X commented Aug 16, 2017

Snack-X commented Aug 16, 2017

Snack-X commented Sep 25, 2017

albinekb commented Sep 25, 2017

Fix Unicode normalization on macOS #1063

Fix Unicode normalization on macOS #1063

Conversation

Snack-X commented Dec 1, 2016

albinekb commented Feb 26, 2017

chabou commented Aug 14, 2017

Snack-X commented Aug 16, 2017

Snack-X commented Aug 16, 2017

Snack-X commented Sep 25, 2017

albinekb commented Sep 25, 2017