Ensure shuffle split default durations uses proper prefix #4991

fjetter · 2021-06-29T15:10:23Z

The test is bit heavy considering that I simply want to verify that the prefix name of shuffle doesn't go unnoticed. but it works and still runs fast so I'm fine with it 😬

Closes Shuffle split layer tasks renamed dask#7844
Tests added / passed
Passes black distributed / flake8 distributed / isort distributed

fjetter · 2021-06-29T15:12:25Z

distributed/tests/test_scheduler.py

+    default_time = parse_timedelta(
+        dask.config.get("distributed.scheduler.default-task-durations")[split_prefix]
+    )
+    assert default_time <= 1e-6


I cannot assert on the actual known runtime since at this point it is/was measured already. I want to ensure that the actual config value is correct. There are other tests ensuring that these config values are used if they exist

distributed/tests/test_scheduler.py

jrbourbeau · 2021-06-29T20:39:23Z

distributed/tests/test_scheduler.py

@@ -1845,6 +1845,37 @@ async def test_get_task_duration(c, s, a, b):
        assert len(s.unknown_durations["slowinc"]) == 1


+@gen_cluster(client=True)
+async def test_default_task_duration_splits(c, s, a, b):


Hmm this test seems to raise TimeoutErrors in a few CI builds

I'm pretty sure this is connected to improper cluster shutdowns I've been observing recently. I am using almost identical code for the stealing test with the exception that I'm not awaiting the computation here. I'll add a wait in here and hope this is gone afterwards

See also dask/dask#7844

fjetter · 2021-06-30T09:52:32Z

I had one green-ish after the last commit. I retriggered again to see if the test causes trouble. for reference, the one failure I had before was test_statistical_profiling and is likely unrelated

mrocklin · 2021-06-30T14:24:56Z

One CI failure. Maybe ok?

fjetter · 2021-06-30T14:55:24Z

This CI failure is #4859

I would declare the test as "stable enough" after two reruns and go ahead with merging this.

jrbourbeau

Thanks @fjetter!

fjetter mentioned this pull request Jun 29, 2021

Shuffle split layer tasks renamed dask/dask#7844

Closed

2 tasks

fjetter commented Jun 29, 2021

View reviewed changes

jrbourbeau reviewed Jun 29, 2021

View reviewed changes

distributed/tests/test_scheduler.py Outdated Show resolved Hide resolved

jrbourbeau reviewed Jun 29, 2021

View reviewed changes

Ensure shuffle split default durations uses proper prefix

00a4226

See also dask/dask#7844

fjetter force-pushed the split_default_duration branch from ff03de9 to 00a4226 Compare June 30, 2021 08:26

jrbourbeau approved these changes Jun 30, 2021

View reviewed changes

jrbourbeau merged commit 84641e7 into dask:main Jun 30, 2021

fjetter added performance stealing labels Jun 20, 2022

crusaderky mentioned this pull request Mar 8, 2024

Make CI happy again #8560

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure shuffle split default durations uses proper prefix #4991

Ensure shuffle split default durations uses proper prefix #4991

fjetter commented Jun 29, 2021

fjetter Jun 29, 2021

jrbourbeau Jun 29, 2021

fjetter Jun 30, 2021

fjetter commented Jun 30, 2021

mrocklin commented Jun 30, 2021

fjetter commented Jun 30, 2021

jrbourbeau left a comment

Ensure shuffle split default durations uses proper prefix #4991

Ensure shuffle split default durations uses proper prefix #4991

Conversation

fjetter commented Jun 29, 2021

fjetter Jun 29, 2021

Choose a reason for hiding this comment

jrbourbeau Jun 29, 2021

Choose a reason for hiding this comment

fjetter Jun 30, 2021

Choose a reason for hiding this comment

fjetter commented Jun 30, 2021

mrocklin commented Jun 30, 2021

fjetter commented Jun 30, 2021

jrbourbeau left a comment

Choose a reason for hiding this comment