Test load balancing - parallelize jest tests

### Specification

Since we are expanding our testing to include other platforms like macos and windows, this creates an issue for VERY large tests that we have in PK.

In PK we are spinning up a test job for each subdirectory under `tests`, this is not efficient under the 10 minute startup routines for windows and mac runners. This is also done via child pipelines, which relies on a script that generates the child pipeline jobs.

The only way to do this would be to do test load balancing instead. This is where we create N number of linux, mac, windows runners, and then distribute tests across the N runners. The N number is determined based a tradeoff between fastest pipeline completion time vs total minutes being used. That is we want highest possible pipeline completion with the lowest total minutes being used. The intersection between the 2 lines is the sweet spot. This would have to be figured out manually & iteratively. The test suite here is very small though, so this should be done on PK instead.

However upon using load balancing, we lose track of individual jobs tackling test directories. Right now we are able to see that only certain areas fail, while other areas succeed. To regain this ability, we have to use the test reporters (which is currently integrated already as `junit`). There are junit options that can be utilised to maximise the utility of these reports:

- `classNameTemplate: '{classname}'` - sets the `suite` column on gitlab to the name of the top-level describe of the test file
- `titleTemplate: '{title}'` - sets the `name` field on gitlab to the name of the test
- `addFileAttribute: 'true'` - adds the filename to the junit reports
- `includeConsoleOutput: 'true'` - includes console output in the report

Our test reporters now can tell us which test/tests are failing without us having to dig through the logs.

But jest is not capturing STDOUT or STDERR anymore.

The alternative is to change our `logger` to use the `console.error` instead, which is intercepted by jest and buffered up and should be shown to the test reporters. The `includeConsoleOutout` option only affects stdout though, so we need a way to work around this.

The buffering causes an issue during development though, it's nice to be able to see the logs immediately when we are debugging test issues, especially for the networking code where the ordering matters.

If we can get jest to unbuffer the logs, or change back to `process.stderr` when the `--ci` flag is not set, then we can see the logs unbuffered when running the tests. Right now the `--ci` flag is set when running the tests in the CI/CD.

Finally within our test runners, because each runner only 1 CPU, we can use `--runInBand` to keep it simple.

### Additional context

* According to this discussion: https://github.com/jest-community/jest-junit/discussions/208, the STDERR and STDOUT isn't being captured by jest anymore, so test reporters aren't able to show them.
* https://gitlab.com/gitlab-org/gitlab/-/issues/201869 - Gitlab themselves have used a custom scheduler to split up their jest tests so that it can be parallelised, see associated issues and the MR that they did
* https://docs.gitlab.com/ee/user/gitlab_com/index.html#gitlab-cicd - CI/CD shared runner limitations
* https://forum.gitlab.com/t/what-are-resource-limits-for-ci-jobs-on-gitlab-com-ram-cpus/29821/4 - discussion claiming that there's only 1 CPU for each job
* https://github.com/MatrixAI/js-polykey/issues/380#issuecomment-1150617925 conditional testing PR for polykey with discussion of jest CLI options including `--shard`
* https://github.com/MatrixAI/js-polykey/issues/380#issuecomment-1150623434 - consider using file size metric for bin-packing on the first time run, then using timing data for subsequent runs

### Tasks

1. Investigate and implement sharding
2. Optimise junit reports to include as much useful info as possible on gitlab


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Test load balancing - parallelize jest tests #58

Specification

Additional context

Tasks

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Test load balancing - parallelize jest tests #58

Description

Specification

Additional context

Tasks

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions