Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: patch jsii-rosetta to limit the number of worker threads #14389

Merged
merged 1 commit into from
Apr 27, 2021

Conversation

njlynch
Copy link
Contributor

@njlynch njlynch commented Apr 27, 2021

jsii-rosetta uses multiple worker threads by default to speed up sample
extraction. It defaults to spawning workers equal to half the number of
cores. On extremely powerful build machines (e.g., CodeBuild X2_LARGE compute
with 72 vCPUs), the high number of workers (36) results in thrash and high
memory usage due to duplicate loads of source files. This was causing the v2
builds to fail with:
"FATAL ERROR: NewSpace::Rebalance Allocation failed - JavaScript heap out of memory"

The patch simply limits the top number of worker threads to an
arbitrarily-chosen maximum limit of 16. We could simply disable the worker
threads, but this takes much longer to process. With single-threading, rosetta
takes ~35 minutes to extract samples from the CDK; with 16 workers, it takes
~3.5 minutes.

I absolutely want to find the proper way to backfeed this into rosetta, but
opting for the simple patch approach at the moment because our v2 builds have
been blocked for about a week already.


By submitting this pull request, I confirm that my contribution is made under the terms of the Apache-2.0 license

jsii-rosetta uses multiple worker threads by default to speed up sample
extraction.  It defaults to spawning workers equal to half the number of
cores. On extremely powerful build machines (e.g., CodeBuild X2_LARGE compute
with 72 vCPUs), the high number of workers (36) results in thrash and high
memory usage due to duplicate loads of source files. This was causing the v2
builds to fail with:
"FATAL ERROR: NewSpace::Rebalance Allocation failed - JavaScript heap out of memory"

The patch simply limits the top number of worker threads to an
arbitrarily-chosen maximum limit of 16. We could simply disable the worker
threads, but this takes much longer to process. With single-threading, rosetta
takes ~35 minutes to extract samples from the CDK; with 16 workers, it takes
~3.5 minutes.

I absolutely want to find the proper way to backfeed this into rosetta, but
opting for the simple patch approach at the moment because our v2 builds have
been blocked for about a week already.
@njlynch njlynch requested a review from a team April 27, 2021 08:24
@njlynch njlynch self-assigned this Apr 27, 2021
@gitpod-io
Copy link

gitpod-io bot commented Apr 27, 2021

@mergify mergify bot added the contribution/core This is a PR that came from AWS. label Apr 27, 2021
@aws-cdk-automation
Copy link
Collaborator

AWS CodeBuild CI Report

  • CodeBuild project: AutoBuildProject89A8053A-LhjRyN9kxr8o
  • Commit ID: 6720966
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@mergify
Copy link
Contributor

mergify bot commented Apr 27, 2021

Thank you for contributing! Your pull request will be updated from master and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork).

@mergify mergify bot merged commit 07fe771 into v2-main Apr 27, 2021
@mergify mergify bot deleted the njlynch/pack-build-fix branch April 27, 2021 08:46
@njlynch
Copy link
Contributor Author

njlynch commented Apr 27, 2021

@Mergifyio backport master

mergify bot pushed a commit that referenced this pull request Apr 27, 2021
jsii-rosetta uses multiple worker threads by default to speed up sample
extraction.  It defaults to spawning workers equal to half the number of
cores. On extremely powerful build machines (e.g., CodeBuild X2_LARGE compute
with 72 vCPUs), the high number of workers (36) results in thrash and high
memory usage due to duplicate loads of source files. This was causing the v2
builds to fail with:
"FATAL ERROR: NewSpace::Rebalance Allocation failed - JavaScript heap out of memory"

The patch simply limits the top number of worker threads to an
arbitrarily-chosen maximum limit of 16. We could simply disable the worker
threads, but this takes much longer to process. With single-threading, rosetta
takes ~35 minutes to extract samples from the CDK; with 16 workers, it takes
~3.5 minutes.

I absolutely want to find the proper way to backfeed this into rosetta, but
opting for the simple patch approach at the moment because our v2 builds have
been blocked for about a week already.

----

*By submitting this pull request, I confirm that my contribution is made under the terms of the Apache-2.0 license*

(cherry picked from commit 07fe771)
@mergify
Copy link
Contributor

mergify bot commented Apr 27, 2021

Command backport master: success

Backports have been created

njlynch added a commit to aws/jsii that referenced this pull request Apr 27, 2021
The current behavior of rosetta (extract) is to use a number of node worker
threads equal to half the number of CPU cores. On large CI/CD build hardware,
this may mean creating a huge number of worker threads which causes more
contention than benefit.

Introduce JSII_ROSETTA_MAX_WORKER_COUNT to limit the maximum number of workers
(defaults to 16).

See aws/aws-cdk#14389 for more motivation from the AWS CDK.
mergify bot added a commit that referenced this pull request Apr 27, 2021
…port #14389) (#14393)

This is an automatic backport of pull request #14389 done by [Mergify](https://mergify.io).

---


<details>
<summary>Mergify commands and options</summary>

<br />

More conditions and actions can be found in the [documentation](https://docs.mergify.io/).

You can also trigger Mergify actions by commenting on this pull request:

- `@Mergifyio refresh` will re-evaluate the rules
- `@Mergifyio rebase` will rebase this PR on its base branch
- `@Mergifyio update` will merge the base branch into this PR
- `@Mergifyio backport <destination>` will backport this PR on `<destination>` branch

Additionally, on Mergify [dashboard](https://dashboard.mergify.io/) you can:

- look at your merge queues
- generate the Mergify configuration with the config editor.

Finally, you can contact us on https://mergify.io/
</details>
njlynch added a commit to aws/jsii that referenced this pull request Apr 27, 2021
The current behavior of rosetta (extract) is to use a number of node worker
threads equal to half the number of CPU cores. On large CI/CD build hardware,
this may mean creating a huge number of worker threads which causes more
contention than benefit.

Introduce JSII_ROSETTA_MAX_WORKER_COUNT to limit the maximum number of workers
(defaults to 16).

See aws/aws-cdk#14389 for more motivation from the AWS CDK.
mergify bot pushed a commit to aws/jsii that referenced this pull request Apr 27, 2021
The current behavior of rosetta (extract) is to use a number of node worker
threads equal to half the number of CPU cores. On large CI/CD build hardware,
this may mean creating a huge number of worker threads which causes more
contention than benefit.

Introduce `JSII_ROSETTA_MAX_WORKER_COUNT` to limit the maximum number of workers
(defaults to `16`).

See aws/aws-cdk#14389 for more motivation from the AWS CDK.

---

By submitting this pull request, I confirm that my contribution is made under the terms of the [Apache 2.0 license].

[Apache 2.0 license]: https://www.apache.org/licenses/LICENSE-2.0
john-tipper pushed a commit to john-tipper/aws-cdk that referenced this pull request May 10, 2021
…port aws#14389) (aws#14393)

This is an automatic backport of pull request aws#14389 done by [Mergify](https://mergify.io).

---


<details>
<summary>Mergify commands and options</summary>

<br />

More conditions and actions can be found in the [documentation](https://docs.mergify.io/).

You can also trigger Mergify actions by commenting on this pull request:

- `@Mergifyio refresh` will re-evaluate the rules
- `@Mergifyio rebase` will rebase this PR on its base branch
- `@Mergifyio update` will merge the base branch into this PR
- `@Mergifyio backport <destination>` will backport this PR on `<destination>` branch

Additionally, on Mergify [dashboard](https://dashboard.mergify.io/) you can:

- look at your merge queues
- generate the Mergify configuration with the config editor.

Finally, you can contact us on https://mergify.io/
</details>
hollanddd pushed a commit to hollanddd/aws-cdk that referenced this pull request Aug 26, 2021
…port aws#14389) (aws#14393)

This is an automatic backport of pull request aws#14389 done by [Mergify](https://mergify.io).

---


<details>
<summary>Mergify commands and options</summary>

<br />

More conditions and actions can be found in the [documentation](https://docs.mergify.io/).

You can also trigger Mergify actions by commenting on this pull request:

- `@Mergifyio refresh` will re-evaluate the rules
- `@Mergifyio rebase` will rebase this PR on its base branch
- `@Mergifyio update` will merge the base branch into this PR
- `@Mergifyio backport <destination>` will backport this PR on `<destination>` branch

Additionally, on Mergify [dashboard](https://dashboard.mergify.io/) you can:

- look at your merge queues
- generate the Mergify configuration with the config editor.

Finally, you can contact us on https://mergify.io/
</details>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
contribution/core This is a PR that came from AWS.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants