util: improve unicode support #31319

BridgeAR · 2020-01-11T19:13:23Z

The array grouping function relies on the width of the characters.
It was not calculated correct so far, since it used the string
length instead.
This improves the unicode output by calculating the mono-spaced
font width (other fonts might differ).

I had to move some functions. Otherwise we'd have to load the utils functions by default and that did not seem necessary.

Checklist

make -j4 test (UNIX), or vcbuild test (Windows) passes
tests and/or benchmarks are included
documentation is changed or added
commit message follows commit guidelines

Trott · 2020-01-13T15:22:52Z

I'm not sure who to ping for a review. @srl295 maybe?

srl295

LGTM. Could optimize the existing code, but generally LGTM.

srl295 · 2020-01-13T17:15:55Z

lib/internal/util/inspect.js

+  const isFullWidthCodePoint = (code) => {
+    // Code points are partially derived from:
+    // http://www.unicode.org/Public/UNIDATA/EastAsianWidth.txt
+    return code >= 0x1100 && (


So this might be doable as a regex… it could be compiled as a regex, i don't think there's an East Asian Width property available in regex.

This could definitely be a regular expression. I guess it's slower that way but I did not check. I'll have a look soon.

ICU4C also has API to get the East Asian Width.

Yes, we use that in case Node.js is build with ICU but this is the fallback code.

The array grouping function relies on the width of the characters. It was not calculated correct so far, since it used the string length instead. This improves the unicode output by calculating the mono-spaced font width (other fonts might differ).

nodejs-github-bot · 2020-01-17T08:54:06Z

CI: https://ci.nodejs.org/job/node-test-pull-request/28463/

Trott · 2020-01-18T00:29:01Z

Relevant test failures in the no-intl host on CI?

nodejs-github-bot · 2020-01-20T17:28:20Z

CI: https://ci.nodejs.org/job/node-test-pull-request/28510/

The array grouping function relies on the width of the characters. It was not calculated correct so far, since it used the string length instead. This improves the unicode output by calculating the mono-spaced font width (other fonts might differ). PR-URL: #31319 Reviewed-By: James M Snell <jasnell@gmail.com> Reviewed-By: Steven R Loomis <srloomis@us.ibm.com> Reviewed-By: Rich Trott <rtrott@gmail.com> Reviewed-By: Minwoo Jung <nodecorelab@gmail.com>

BridgeAR · 2020-01-22T14:53:35Z

Landed in 8fb5fe2 🎉

The array grouping function relies on the width of the characters. It was not calculated correct so far, since it used the string length instead. This improves the unicode output by calculating the mono-spaced font width (other fonts might differ). PR-URL: #31319 Reviewed-By: James M Snell <jasnell@gmail.com> Reviewed-By: Steven R Loomis <srloomis@us.ibm.com> Reviewed-By: Rich Trott <rtrott@gmail.com> Reviewed-By: Minwoo Jung <nodecorelab@gmail.com>

codebytere · 2020-03-15T00:54:11Z

@BridgeAR if this should go back to v12.x it'll need a manual backport, but feel free to update the label if it shouldn't land!

The array grouping function relies on the width of the characters. It was not calculated correct so far, since it used the string length instead. This improves the unicode output by calculating the mono-spaced font width (other fonts might differ). PR-URL: nodejs#31319 Reviewed-By: James M Snell <jasnell@gmail.com> Reviewed-By: Steven R Loomis <srloomis@us.ibm.com> Reviewed-By: Rich Trott <rtrott@gmail.com> Reviewed-By: Minwoo Jung <nodecorelab@gmail.com>

The array grouping function relies on the width of the characters. It was not calculated correct so far, since it used the string length instead. This improves the unicode output by calculating the mono-spaced font width (other fonts might differ). PR-URL: #31319 Reviewed-By: James M Snell <jasnell@gmail.com> Reviewed-By: Steven R Loomis <srloomis@us.ibm.com> Reviewed-By: Rich Trott <rtrott@gmail.com> Reviewed-By: Minwoo Jung <nodecorelab@gmail.com>

nodejs-github-bot added readline Issues and PRs related to the built-in readline module. util Issues and PRs related to the built-in util module. labels Jan 11, 2020

BridgeAR force-pushed the 2020-01-11-util-better-unicode-support branch from 81735c6 to 02515e8 Compare January 11, 2020 22:36

This comment has been minimized.

Sign in to view

BridgeAR force-pushed the 2020-01-11-util-better-unicode-support branch from 02515e8 to 80fe23b Compare January 12, 2020 02:30

This comment has been minimized.

Sign in to view

BridgeAR force-pushed the 2020-01-11-util-better-unicode-support branch from da70f98 to fc7f090 Compare January 12, 2020 23:38

jasnell approved these changes Jan 13, 2020

View reviewed changes

srl295 approved these changes Jan 13, 2020

View reviewed changes

Trott approved these changes Jan 14, 2020

View reviewed changes

JungMinu approved these changes Jan 16, 2020

View reviewed changes

util: improve unicode support

1b9547f

The array grouping function relies on the width of the characters. It was not calculated correct so far, since it used the string length instead. This improves the unicode output by calculating the mono-spaced font width (other fonts might differ).

BridgeAR force-pushed the 2020-01-11-util-better-unicode-support branch from fc7f090 to 1b9547f Compare January 17, 2020 08:51

BridgeAR added the author ready PRs that have at least one approval, no pending requests for changes, and a CI started. label Jan 17, 2020

fixup

31282d7

Trott removed the author ready PRs that have at least one approval, no pending requests for changes, and a CI started. label Jan 18, 2020

fixup

44db342

BridgeAR added the author ready PRs that have at least one approval, no pending requests for changes, and a CI started. label Jan 20, 2020

BridgeAR closed this Jan 22, 2020

codebytere mentioned this pull request Feb 17, 2020

v13.9.0 proposal #31837

Merged

codebytere added the backport-requested-v12.x label Mar 15, 2020

targos removed backport-requested-v12.x author ready PRs that have at least one approval, no pending requests for changes, and a CI started. labels Apr 25, 2020

targos mentioned this pull request May 2, 2020

v12.17.0 release proposal #33197

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

util: improve unicode support #31319

util: improve unicode support #31319

BridgeAR commented Jan 11, 2020

This comment has been minimized.

This comment has been minimized.

Trott commented Jan 13, 2020

srl295 left a comment

srl295 Jan 13, 2020

BridgeAR Jan 14, 2020

srl295 Jan 14, 2020

BridgeAR Jan 16, 2020

nodejs-github-bot commented Jan 17, 2020

Trott commented Jan 18, 2020

nodejs-github-bot commented Jan 20, 2020

BridgeAR commented Jan 22, 2020

codebytere commented Mar 15, 2020

util: improve unicode support #31319

util: improve unicode support #31319

Conversation

BridgeAR commented Jan 11, 2020

Checklist

This comment has been minimized.

This comment has been minimized.

Trott commented Jan 13, 2020

srl295 left a comment

Choose a reason for hiding this comment

srl295 Jan 13, 2020

Choose a reason for hiding this comment

BridgeAR Jan 14, 2020

Choose a reason for hiding this comment

srl295 Jan 14, 2020

Choose a reason for hiding this comment

BridgeAR Jan 16, 2020

Choose a reason for hiding this comment

nodejs-github-bot commented Jan 17, 2020

Trott commented Jan 18, 2020

nodejs-github-bot commented Jan 20, 2020

BridgeAR commented Jan 22, 2020

codebytere commented Mar 15, 2020