Simplify CI test workflow #279

ian-r-rose · 2022-08-25T22:16:42Z

We currently run the submodules in the tests/ directory in separate jobs:

benchmarks
runtime
stability

Each of those jobs has its own test matrix, so we actually wind up having a large number of sub-jobs for every workflow run.

I'm not convinced that is serving us well today:

Each job is parallelized using pytest-xdist with ten workers. That makes a lot of sense, since the vast majority of the work is done by coiled clusters, and the CI runner is mostly sitting around waiting for results. However, we aren't getting the full value out of pytest-xdist because we are running it with loadscope, so each test module is run as a single unit. This basically means we don't have enough modules to actually saturate the xdist workers.
The different jobs have different test matrices. In particular, "stability" has the largest matrix, testing a few combinations that are not tested in others. I think this is because, at the time of writing, "stability" was the fastest submodule to run, so we could afford the extra matrices (is this right @jrbourbeau?). But since Integration tests for spilling #229 the stability tests are quite slow. @hendrikmakait may have ideas to speed it up a bit, but I still am not convinced it's a good idea to have different test matrices for different submodules.
There is a lot of code duplication in the workflow yaml for each submodule, which adds cognitive overhead.

I propose we consolidate the above jobs into a single "test" job, which is itself parameterized over a consistent test matrix. I doubt that would actually change the runtime too much, and might even speed it up, as the xdist workers would be better able to balance work. It would also simplify maintenance going forward.

The text was updated successfully, but these errors were encountered:

ian-r-rose added the dx Developer experience label Aug 25, 2022

ian-r-rose mentioned this issue Sep 1, 2022

Automation of benchmark comparison #292

Closed

ncclementi mentioned this issue Sep 2, 2022

Consider splitting integration tests and benchmarks into different repo #298

Open

ian-r-rose mentioned this issue Sep 21, 2022

A/B tests with package sync + repeats #355

Merged

crusaderky mentioned this issue Sep 24, 2022

Use single job for all test categories #370

Merged

crusaderky self-assigned this Sep 24, 2022

crusaderky closed this as completed in #370 Sep 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify CI test workflow #279

Simplify CI test workflow #279

ian-r-rose commented Aug 25, 2022

Simplify CI test workflow #279

Simplify CI test workflow #279

Comments

ian-r-rose commented Aug 25, 2022