Fix unicode escapes in jsx identifiers and extended unicode characters in jsdoc #32716

weswigham · 2019-08-05T19:28:42Z

Which are the two other places we used isIdentifierStart in the scanner.

…s in jsdoc

ajafff · 2019-08-05T19:44:35Z

src/compiler/scanner.ts

-                while (isIdentifierPart(text.charCodeAt(pos), ScriptTarget.Latest) && pos < end) {
-                    pos++;
+                let nextChar: number;
+                while (isIdentifierPart(nextChar = codePointAt(text, pos), ScriptTarget.Latest) && pos < end) {


is it intentional not to allow unicode escape sequences in JSDoc identifiers?

I believe so, but I dunno. Given that it's a comment, it just means that we'll interpret the \u0061 verbatim. @sandersn would be the authority here.

I'm confused. Doesn't this change cause us to allow escape sequences in JSDoc identifiers?

jsdoc.app doesn't mention unicode as far as google can tell, so we are free to do whatever we want. I vote that JSDoc identifiers work the same as normal ones, which should mean allowing unicode escape sequences.

sandersn

I am confused after reading @ajafff's question. How does this change affect jsdoc? The tests make it seems like unicode identifiers (and escapes?) now work, but his question implies that it doesn't.

sandersn · 2019-08-06T18:31:36Z

src/compiler/scanner.ts

-                    pos++;
+                let nextChar: number;
+                while (isIdentifierPart(nextChar = codePointAt(text, pos), ScriptTarget.Latest) && pos < end) {
+                    pos += charSize(nextChar);


stupid question: why isn't charSize(c) === 2 when c >= 0x10000 instead of c > 0x10000?

sandersn · 2019-08-06T18:33:59Z

src/compiler/scanner.ts

-                while (isIdentifierPart(text.charCodeAt(pos), ScriptTarget.Latest) && pos < end) {
-                    pos++;
+                let nextChar: number;
+                while (isIdentifierPart(nextChar = codePointAt(text, pos), ScriptTarget.Latest) && pos < end) {


I'm confused. Doesn't this change cause us to allow escape sequences in JSDoc identifiers?

jsdoc.app doesn't mention unicode as far as google can tell, so we are free to do whatever we want. I vote that JSDoc identifiers work the same as normal ones, which should mean allowing unicode escape sequences.

weswigham · 2019-08-06T19:45:44Z

he tests make it seems like unicode identifiers (and escapes?) now work, but his question implies that it doesn't.

Unicode characters now scan as identifiers in jsdoc, escapes are still not parsed.

weswigham · 2019-08-06T19:46:05Z

Doesn't this change cause us to allow escape sequences in JSDoc identifiers?

No.

sandersn · 2019-08-06T19:54:52Z

That makes sense. It would be nice to have JSDoc support escapes as well, but I think the right way to do that is to make JSDoc use the normal scanner. That's a complete rewrite of the low-level parsing code too, so I'm not sure it's worth it to us.

weswigham · 2019-08-06T20:14:51Z

@sandersn Oh, too late. I already pushed a commit that adds unicode escape support into jsdoc.

orta · 2019-08-10T10:13:58Z

I'm pretty sure this deprecates #32631 - can someone double check me?

weswigham · 2019-08-10T20:31:15Z

I don't think so - is jsdoc, at least, we're still going to stop parsing before the hyphen.

HiEv · 2019-10-05T08:11:50Z

I don't think so - is jsdoc, at least, we're still going to stop parsing before the hyphen.

I'm a bit confused by this reply. In JSDoc you shouldn't stop parsing before the hyphen if there's no space. The JSDoc documentation says that it should be able to handle dashes in its member names. For example:

/**
 * @typedef {object} node
 * @property {string} aria-activedescendant - Description.
 */
/** @type {node} */
var node;

That does work in JSDoc from the commandline, but not in VSCode. (See the comments here.)

In VSCode, if you hover the mouse over the "node" on the "@typedef" line, it should show:

type node = {
	aria-activedescendant: string;
}

However, instead you get:

type node = {
	aria: string;
}

with the "-activedescendant" part missing.

If you put quotes around "aria-activedescendant", then you get this instead:

type node = {
	(Missing): string;
}

#32631 was supposed to fix that problem (#14395).

Fix unicode escapes in jsx identifiers and extended unicode character…

5e66535

…s in jsdoc

weswigham requested review from sandersn and RyanCavanaugh August 5, 2019 19:28

ajafff reviewed Aug 5, 2019

View reviewed changes

sandersn requested changes Aug 6, 2019

View reviewed changes

Merge branch 'master' into simplify-jsx-identifier-scanning

ae474aa

sandersn approved these changes Aug 6, 2019

View reviewed changes

Support unicode escapes in JSDoc

c19545e

weswigham added 2 commits August 6, 2019 14:44

Merge branch 'master' into simplify-jsx-identifier-scanning

49bd003

Add tests for extended escapes

628454e

sandersn approved these changes Aug 6, 2019

View reviewed changes

weswigham merged commit f333684 into microsoft:master Aug 6, 2019

weswigham deleted the simplify-jsx-identifier-scanning branch August 6, 2019 22:14

sandersn mentioned this pull request Aug 7, 2019

🤖 User test baselines have changed #32753

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix unicode escapes in jsx identifiers and extended unicode characters in jsdoc #32716

Fix unicode escapes in jsx identifiers and extended unicode characters in jsdoc #32716

Uh oh!

weswigham commented Aug 5, 2019

Uh oh!

ajafff Aug 5, 2019

Uh oh!

weswigham Aug 5, 2019

Uh oh!

sandersn Aug 6, 2019

Uh oh!

sandersn left a comment

Uh oh!

sandersn Aug 6, 2019

Uh oh!

sandersn Aug 6, 2019

Uh oh!

weswigham commented Aug 6, 2019

Uh oh!

weswigham commented Aug 6, 2019

Uh oh!

sandersn commented Aug 6, 2019

Uh oh!

weswigham commented Aug 6, 2019

Uh oh!

orta commented Aug 10, 2019

Uh oh!

weswigham commented Aug 10, 2019

Uh oh!

HiEv commented Oct 5, 2019 •

edited

Loading

Uh oh!

Uh oh!

Fix unicode escapes in jsx identifiers and extended unicode characters in jsdoc #32716

Fix unicode escapes in jsx identifiers and extended unicode characters in jsdoc #32716

Uh oh!

Conversation

weswigham commented Aug 5, 2019

Uh oh!

ajafff Aug 5, 2019

Choose a reason for hiding this comment

Uh oh!

weswigham Aug 5, 2019

Choose a reason for hiding this comment

Uh oh!

sandersn Aug 6, 2019

Choose a reason for hiding this comment

Uh oh!

sandersn left a comment

Choose a reason for hiding this comment

Uh oh!

sandersn Aug 6, 2019

Choose a reason for hiding this comment

Uh oh!

sandersn Aug 6, 2019

Choose a reason for hiding this comment

Uh oh!

weswigham commented Aug 6, 2019

Uh oh!

weswigham commented Aug 6, 2019

Uh oh!

sandersn commented Aug 6, 2019

Uh oh!

weswigham commented Aug 6, 2019

Uh oh!

orta commented Aug 10, 2019

Uh oh!

weswigham commented Aug 10, 2019

Uh oh!

HiEv commented Oct 5, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

HiEv commented Oct 5, 2019 •

edited

Loading