A/B test jobs #314

crusaderky · 2022-09-08T09:28:03Z

Partially closes Automation of benchmark comparison #292
Complements A/B test reports #309

This PR adds the ability to run the whole test suite on arbitrary additional coiled software environments.

crusaderky · 2022-09-08T09:31:07Z

AB_environments/AB_sample.dask.yaml

+# Leave empty if you don't want to override anything.
+distributed:
+  scheduler:
+    worker-saturation: 1.2


Notably, if #235 will go through we'll lose the ability to pass environment variables.

The alternative to forcing the developer to create an empty file every time was to ignore the file if it doesn't exist, but the risk of misspellings producing unexpected results was too high. Explicit is better than implicit.

crusaderky · 2022-09-09T09:43:47Z

Ready for final review.
Run with files in AB_environments/: https://github.com/coiled/coiled-runtime/actions/runs/3018905647

crusaderky · 2022-09-09T10:15:12Z

Question:

In the context of A/B performance tests, do we care about the runtime and stability test folders?
CC @ncclementi @ian-r-rose

hayesgb · 2022-09-09T11:05:30Z

I can see a solid argument for skipping the runtime folder, but seems to me it would preferable to keep the stability folder in scope.

fjetter

full disclosure: I haven't reviewed ab_tests in much detail but rather the broad strokes.

My comments are mostly nitpicks. If there are issues or missing features we can iterate.

Looking forward to see this in action!

fjetter · 2022-09-09T09:56:45Z

AB_environments/README.md

+      - dask==2022.8.1
+      - distributed=2022.8.1
+```
+- `AB_environments/AB_baseline.dask.yaml`: (empty file)


Does there need to be an empty file?

Yes. The alternative to forcing the developer to create an empty file every time was to ignore the file if it doesn't exist, but the risk of misspellings quietly resulting in unexpected results was too high. Explicit is better than implicit.

Why not include empty file with appropriate name in repo? That would mean you'd get no custom config as the default, but also make it easier to not accidentally make a new file w/ wrong name (e.g., you'd either edit existing file or you'd use cp your-file AB_[tab] and autocomplete to correct filename).

fjetter · 2022-09-09T11:09:35Z

ci/scripts/dask_config_to_env.py

+import yaml
+
+
+def main(fname: str) -> None:


I believe we're now able to ship dask config using the dask.config ctx manager. That might be a better interface than this. is there a reason why we use env vars instead?

I believe we're now able to ship dask config using the dask.config ctx manager.

Just to be clear, change to ship dask config to cluster is not yet deployed, presumably will go out next week.

? This is news to me.
Are you saying that when you call coiled.create_software_environment it will pick up the local config?
Also, I needed something that could be passed to coiled env create. Ideally one would want to pass it a dask config file directly, but I believe there's no such feature?

Now I'm confused. The feature is that when you create cluster it ships local config. I don't see any connection between dask config and the software environment... am I missing something?

Just to be clear, change to ship dask config to cluster is not yet deployed, presumably will go out next week.

This is exactly what we need, please ping me when it's generally available

Now I'm confused. The feature is that when you create cluster it ships local config. I don't see any connection between dask config and the software environment... am I missing something?

The script is currently calling coiled env create -e DASK_DISTRIBUTED_...=VALUE

The script is currently calling coiled env create -e DASK_DISTRIBUTED_...=VALUE

I could be wrong but I think that just applies when creating the software environment and not when you make a cluster using that software environment.

It works fine for me?

import coiled import dask.config import distributed !coiled env create --name crusaderky/test_vars --conda AB_environments/AB_baseline.conda.yaml -e DASK_TESTVAR=123 cluster = coiled.Cluster(name="test_vars", n_workers=0, software="crusaderky/test_vars") client = distributed.Client(cluster) client.run_on_scheduler(lambda: dask.config.get("testvar"))

123

Oh, thanks, good to know!

crusaderky self-assigned this Sep 8, 2022

crusaderky marked this pull request as draft September 8, 2022 09:29

crusaderky mentioned this pull request Sep 8, 2022

[WIP] Run test suite on arbitrary coiled environments #296

Closed

crusaderky commented Sep 8, 2022

View reviewed changes

crusaderky changed the title ~~A/B testing jobs~~ A/B test jobs Sep 8, 2022

crusaderky mentioned this pull request Sep 8, 2022

A/B test reports #309

Merged

crusaderky force-pushed the ab_testing branch from 042b39d to f32e100 Compare September 8, 2022 17:29

crusaderky closed this Sep 8, 2022

crusaderky reopened this Sep 8, 2022

crusaderky force-pushed the ab_testing branch 2 times, most recently from ff71818 to b008604 Compare September 9, 2022 09:41

crusaderky marked this pull request as ready for review September 9, 2022 09:43

crusaderky requested review from ian-r-rose, jrbourbeau, ncclementi, fjetter, gjoseph92 and hendrikmakait September 9, 2022 09:46

Run test suite on arbitrary coiled environments

88b3a4e

crusaderky force-pushed the ab_testing branch from b008604 to 88b3a4e Compare September 9, 2022 10:13

fjetter approved these changes Sep 9, 2022

View reviewed changes

crusaderky added 2 commits September 9, 2022 15:40

Merge branch 'main' into ab_testing

d6de2c1

Merge #309

8fcb346

crusaderky merged commit a08904c into main Sep 9, 2022

crusaderky deleted the ab_testing branch September 9, 2022 14:42

hendrikmakait pushed a commit that referenced this pull request Sep 14, 2022

A/B test jobs (#314)

8c8fa38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A/B test jobs #314

A/B test jobs #314

crusaderky commented Sep 8, 2022 •

edited

Loading

crusaderky Sep 8, 2022

crusaderky commented Sep 9, 2022

crusaderky commented Sep 9, 2022

hayesgb commented Sep 9, 2022

fjetter left a comment

fjetter Sep 9, 2022

crusaderky Sep 9, 2022

ntabris Sep 9, 2022

fjetter Sep 9, 2022

ntabris Sep 9, 2022

crusaderky Sep 9, 2022

ntabris Sep 9, 2022

crusaderky Sep 9, 2022

crusaderky Sep 9, 2022

ntabris Sep 9, 2022

crusaderky Sep 11, 2022

ntabris Sep 12, 2022

A/B test jobs #314

A/B test jobs #314

Conversation

crusaderky commented Sep 8, 2022 • edited Loading

Choose a reason for hiding this comment

crusaderky commented Sep 9, 2022

crusaderky commented Sep 9, 2022

Question:

hayesgb commented Sep 9, 2022

fjetter left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

crusaderky commented Sep 8, 2022 •

edited

Loading