Implement is_identifier_character using JS regexp (wasm) #113

strager · 2020-12-26T11:21:25Z

After compilation, lex-unicode.cpp is huge. This increases download time for quick-lint-js, making the demo less fun.

JavaScript regular expressions can match based on Unicode character properties (e.g. /\p{L}/u for letters). For wasm-compiled code which runs in Node.js and in the browser, implement lexer::is_identifier_character and lexer::is_initial_identifier_character using JavaScript regexps instead of the existing lookup table.

I estimate this'll reduce uncompressed download sizes by about 130 KiB, and reduce compressed download sizes by about 30 KiB.

The text was updated successfully, but these errors were encountered:

CodeItQuick · 2021-01-06T05:05:27Z

Dibs!

strager · 2022-04-15T09:02:02Z

Given #542, I think this task is a bad idea. We want our own tables to support different Unicode versions.

strager added the good third issue label Dec 26, 2020

strager added this to the public release (v1.0) milestone Dec 26, 2020

strager mentioned this issue Dec 27, 2020

Optimize wasm build binary size #120

Closed

strager assigned CodeItQuick Jan 11, 2021

strager removed this from the public release (v1.0) milestone Jan 26, 2021

strager unassigned CodeItQuick Apr 29, 2021

strager removed the good third issue label May 22, 2021

strager added the performance Slowness or potential optimization label Dec 2, 2021

strager closed this as completed Apr 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement is_identifier_character using JS regexp (wasm) #113

Implement is_identifier_character using JS regexp (wasm) #113

strager commented Dec 26, 2020

CodeItQuick commented Jan 6, 2021

strager commented Apr 15, 2022

Implement is_identifier_character using JS regexp (wasm) #113

Implement is_identifier_character using JS regexp (wasm) #113

Comments

strager commented Dec 26, 2020

CodeItQuick commented Jan 6, 2021

strager commented Apr 15, 2022