Consistently perform bootstrap and encode Brotli config for improved caching and reduced code complexity #1836

devversion · 2022-07-13T18:44:33Z

Note to self: consider closing #1177 when this is merged, if it does indeed apply to worker threads -- @cspotcode

Cleans up the BootstrapState for the new approach, removing all the detection flags of whether we are in the CLI.
Introduces a new type for the bootstrap state that can be encoded into the Brotli config.
- This is a subset of the bootstrap state with explicit fields that can be cached
- This makes it obvious which stuff can be cached/encoded, and also encourages thinking before new stuff is passed through.
- Also it reduces the bytes encoded in the payload
getEntryPointInfo (coming from the previous ESM fix PR) no longer takes the full bootstrap state. It just fine-grained takes what it needs. This is needed because only the necessary properties/state is cached, so it cannot expect the full parsed arguments (see 2)
As we chatted, we now always encode the Brotli state (regardless of ESM or with ESM) into process.execArgv. This ensures that phase1-3 do not need to be repeated when forked (and also makes the mental model easy and more expected).
- A shared function is used for the child process script and execArgv computation. This logic can be used for setting execArgv and for getting the args for going directly from bin.js into the child process when ESM is needed.
The existing index.spec.ts forking recursive execArgv test had to be updated:
- So that is now knows that forked processes always go to the child entry-point with Brotli config (see 4)
- So that we also test the forking execArgv for ESM (aside from the worker actual fork tests we have)
Some of the forking tests I added with my first PR have been renamed and adjusted to not rely on --cwd but just use the working directory when spawning ts-node (like cd dir && ts-node).

devversion · 2022-07-13T18:47:48Z

cc. @cspotcode. I'm super open to not always go to the child entry-point (when e.g. ESM is not enabled), but I believe this is worth considering as it just makes all of the boostrapping more readable / obvious (while we talk about feature creep and code smell)

cspotcode · 2022-07-13T19:04:55Z

Thanks, I'll have to give this some thought.

It's a breaking change to use a child process for all users. It affects PIDs and signal handling and whatnot. And it's a performance regression for users who only need CJS.

Some of us in the ecosystem believe that the child process will eventually be unnecessary, once node adds the appropriate features.
nodejs/node#33903
sindresorhus/import-from#9
gulpjs/rechoir#43 (comment)
A heated discussion cropped up a few hours ago:
nodejs/node#43818

...so I kinda want to treat the child process as a hack that's only done when absolutely necessary.

To get some of the code quality benefits you describe, I suppose we could pass the brotli payload in-process even when we don't spawn a child. So like, create the payload, and pass it in-memory to the next phase of bootstrapping.

That gives us some of the safety and code readability guarantees you describe without imposing the breaking change / perf hit on CJS consumers.

devversion · 2022-07-13T20:09:11Z

@cspotcode Yeah, it's trivial to just jump into the final phase in-memory (from the same process). Good points you raised there. I will make this change and rebase the PR. Should be ready for your review/thoughts tomorrow.

codecov · 2022-07-14T11:57:54Z

Codecov Report

Merging #1836 (9c0a5a9) into main (97f9afd) will decrease coverage by 0.00%.
The diff coverage is 93.58%.

Impacted Files	Coverage Δ
src/child/spawn-child-with-esm.ts	`82.35% <82.35%> (ø)`
src/bin.ts	`88.29% <95.74%> (-0.17%)`	⬇️
src/child/child-entrypoint.ts	`87.50% <100.00%> (-5.36%)`	⬇️
src/child/child-exec-args.ts	`100.00% <100.00%> (ø)`
src/child/argv-payload.ts	`83.33% <0.00%> (-16.67%)`	⬇️

cspotcode · 2022-07-14T13:17:33Z

Only skimming the code, so I may have misinterpreted. But FWIW it's ok to make the breaking change to stop resolving entrypoint relative to our --cwd. Not sure if that simplifies things.

cspotcode · 2022-07-14T13:18:38Z

Also feel free to push a long commit history, no need to force-push if you don't want to. We will squash merge it to main anyway.

devversion · 2022-07-14T13:29:39Z

@cspotcode Thanks for looking. Sounds good. I will not squash the fixup commits (just a habit of other repos). Regarding --cwd. It's low-effort to keep it working as part of this PR, but it will be easy to remove this in a follow-up. thx

cspotcode · 2022-07-14T13:32:29Z

Sounds good. FWIW I'm going to start merging breaking changes to main, so we're fully in breakage mode.

devversion · 2022-07-14T14:03:46Z

Nice! just for your information: This PR is now green and ready for review when you find some time. No rush, thanks!

cspotcode · 2022-07-14T15:08:15Z

Thanks, can I get a quick bulleted list of the changes and their motivation, to assist review? In another comment, or by updating the issue description?

ts-node --esm avoids loading the typescript compiler in the parent process, which is good for performance. Loading TS compiler is slow. Does this PR preserve that optimization?

devversion · 2022-07-14T16:29:36Z

@cspotcode This PR:

Cleans up the BootstrapState for the new approach, removing all the detection flags of whether we are in the CLI.
Introduces a new type for the bootstrap state that can be encoded into the Brotli config.
- This is a subset of the bootstrap state with explicit fields that can be cached
- This makes it obvious which stuff can be cached/encoded, and also encourages thinking before new stuff is passed through.
- Also it reduces the bytes encoded in the payload
getEntryPointInfo (coming from the previous ESM fix PR) no longer takes the full bootstrap state. It just fine-grained takes what it needs. This is needed because only the necessary properties/state is cached, so it cannot expect the full parsed arguments (see 2)
As we chatted, we now always encode the Brotli state (regardless of ESM or with ESM) into process.execArgv. This ensures that phase1-3 do not need to be repeated when forked (and also makes the mental model easy and more expected).
- A shared function is used for the child process script and execArgv computation. This logic can be used for setting execArgv and for getting the args for going directly from bin.js into the child process when ESM is needed.
The existing index.spec.ts forking recursive execArgv test had to be updated:
- So that is now knows that forked processes always go to the child entry-point with Brotli config (see 4)
- So that we also test the forking execArgv for ESM (aside from the worker actual fork tests we have)
Some of the forking tests I added with my first PR have been renamed and adjusted to not rely on --cwd but just use the working directory when spawning ts-node (like cd dir && ts-node).

is this sufficient information?

ts-node --esm avoids loading the typescript compiler in the parent process, which is good for performance. Loading TS compiler is slow. Does this PR preserve that optimization?

This is something I'm not sure if worth doing. It would introduce some complexity again, for something that you also acknowledged to be just a workaround in the long-term. I do realize that loading TS is usually a little slow, but is it that big of a deal, people noticing in the meanwhile? Also because it already loads twice if people enable esm in the tsconfig. The complexity will be that we now would also need to pass all parseArgv result into the initial child entry-point (needed for phase3), and so on..

…caching/reduced complexity Additionally, this PR streamlines the boostrap mechanism to always call into the child script, resulting in reduced complexity, and also improved caching for user-initiated forked processes. i.e. the tsconfig resolution is not repeated multiple-times because forked processes are expected to preserve the existing ts-node project. More details can be found here TypeStrong#1831. Fixes TypeStrong#1812.

…proved caching/reduced complexity Do not use `Nodenext` as `esnext` is sufficent and makes it work with all TS versions we have in the matrix.

…proved caching/reduced complexity Re-add fast-path for ESM when we detect it before phase3. And simplify code.

devversion · 2022-07-14T18:55:27Z

@cspotcode Please find the bullet-point list you requested in the above comment. I also added the fast-path to avoid loading the TS compiler when we detect --esm before loading the tsconfig. I was able to actually re-add this without making the code more complicated as I reworked some of the types/structs I created with the initial commits. See the latest fixup.

I also rebased on top of your breaking changes in main

devversion · 2022-07-31T21:03:15Z

Hey @cspotcode, just wanted to check if there is any way I can help move this forward. No rush, just a friendly check-in

devversion · 2022-08-26T20:32:47Z

Friendly ping on this again. Happy to make any changes. also happy to discuss this if this doesn't look reasonable.

devversion · 2023-02-09T13:58:03Z

It's been a while. I'm ready to rebase this, or close this, but would like to get it off my plate. Please give an update.

devversion · 2023-03-02T13:56:32Z

@cspotcode It's unfortunate that I need to close this— due to no response at all. It would have been totally fine to say that it's not worth the review effort given lack of time etc..

In general- I think the current code in main could definitely benefit from some improved type-safety and avoiding unnecessary work when working. At least we were able to merge in #1814 to fix forking separately.

devversion force-pushed the always-encode-brotli-and-fix-forking branch from 7c0678f to 3812d56 Compare July 13, 2022 18:46

devversion force-pushed the always-encode-brotli-and-fix-forking branch 4 times, most recently from a7c46a4 to cbdf2c2 Compare July 14, 2022 11:50

devversion force-pushed the always-encode-brotli-and-fix-forking branch 3 times, most recently from a7e6486 to 6f0ee70 Compare July 14, 2022 13:15

devversion added 2 commits July 14, 2022 18:48

fixup! Consistently perform bootstrap and encode Brotli config for im…

e43b665

…proved caching/reduced complexity Do not use `Nodenext` as `esnext` is sufficent and makes it work with all TS versions we have in the matrix.

devversion force-pushed the always-encode-brotli-and-fix-forking branch from 44ab40b to fbc7754 Compare July 14, 2022 18:49

fixup! Consistently perform bootstrap and encode Brotli config for im…

9c0a5a9

…proved caching/reduced complexity Re-add fast-path for ESM when we detect it before phase3. And simplify code.

devversion force-pushed the always-encode-brotli-and-fix-forking branch from fbc7754 to 9c0a5a9 Compare July 14, 2022 18:53

devversion closed this Mar 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consistently perform bootstrap and encode Brotli config for improved caching and reduced code complexity #1836

Consistently perform bootstrap and encode Brotli config for improved caching and reduced code complexity #1836

devversion commented Jul 13, 2022 •

edited

Loading

devversion commented Jul 13, 2022

cspotcode commented Jul 13, 2022 •

edited

Loading

devversion commented Jul 13, 2022

codecov bot commented Jul 14, 2022 •

edited

Loading

cspotcode commented Jul 14, 2022

cspotcode commented Jul 14, 2022

devversion commented Jul 14, 2022

cspotcode commented Jul 14, 2022

devversion commented Jul 14, 2022

cspotcode commented Jul 14, 2022

devversion commented Jul 14, 2022 •

edited

Loading

devversion commented Jul 14, 2022 •

edited

Loading

devversion commented Jul 31, 2022

devversion commented Aug 26, 2022 •

edited

Loading

devversion commented Feb 9, 2023

devversion commented Mar 2, 2023

Consistently perform bootstrap and encode Brotli config for improved caching and reduced code complexity #1836

Consistently perform bootstrap and encode Brotli config for improved caching and reduced code complexity #1836

Conversation

devversion commented Jul 13, 2022 • edited Loading

devversion commented Jul 13, 2022

cspotcode commented Jul 13, 2022 • edited Loading

devversion commented Jul 13, 2022

codecov bot commented Jul 14, 2022 • edited Loading

Codecov Report

cspotcode commented Jul 14, 2022

cspotcode commented Jul 14, 2022

devversion commented Jul 14, 2022

cspotcode commented Jul 14, 2022

devversion commented Jul 14, 2022

cspotcode commented Jul 14, 2022

devversion commented Jul 14, 2022 • edited Loading

devversion commented Jul 14, 2022 • edited Loading

devversion commented Jul 31, 2022

devversion commented Aug 26, 2022 • edited Loading

devversion commented Feb 9, 2023

devversion commented Mar 2, 2023

devversion commented Jul 13, 2022 •

edited

Loading

cspotcode commented Jul 13, 2022 •

edited

Loading

codecov bot commented Jul 14, 2022 •

edited

Loading

devversion commented Jul 14, 2022 •

edited

Loading

devversion commented Jul 14, 2022 •

edited

Loading

devversion commented Aug 26, 2022 •

edited

Loading