Replace `test_(do_not_)steal_communication_heavy_tasks` tests with more robust versions #7243

hendrikmakait · 2022-11-02T16:18:02Z

This PR drops flaky test_steal_communication_heavy_tasks_ since it contradicts test_do_not_steal_communication_heavy_tasks and should outdated after the latest changes to work-stealing.

Tests added / passed
Passes pre-commit run --all-files

…o_not_steal_communication_heavy_tasks

github-actions · 2022-11-02T17:58:27Z

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

      15 files ±  0       15 suites ±0 6h 19m 22s ⏱️ - 17m 42s
  3 172 tests +  3   3 088 ✔️ +  3   83 💤 ±0 1 ❌ ±0
23 472 runs +24 22 569 ✔️ +25 900 💤 - 1 3 ❌ ±0

For more details on these failures, see this check.

Results for commit 2510468. ± Comparison against base commit aa1c6d8.

♻️ This comment has been updated with latest results.

fjetter · 2022-11-03T10:14:35Z

If they are contradicting tests, why is it flaky? Shouldn't it reliably fail? This still sounds like there is a bug somewhere

hendrikmakait · 2022-11-03T10:35:43Z

The entire test makes fairly little sense at the moment. I think it should have been dropped in #7075 and yet I missed it. We could spend time cleaning up the test, but the way it is written right now, I think it is prone to a bunch of race conditions and does not test anything useful. Also, any useful behavior should be tested in other places (e.g. test_do_not_steal_communication_heavy_tasks, test_steal_expensive_data_slow_computation)

crusaderky · 2022-11-03T12:32:04Z

distributed/tests/test_steal.py

@@ -825,44 +825,6 @@ def block_reduce(x, y, event):
    assert not b.data


test_do_not_steal_communication_heavy_tasks doesn't look right.

Firstly, there's a race condition. The test waits for x and y to enter running state on the worker, polling every 100ms.
The first iteration of while not a.state.tasks will always fail (because we never yielded the event loop since submit).
The second iteration, 100ms later, most times will find that a and b are in memory and two block_reduce are executing. However there's nothing guaranteeing it, so when you call steal.balance() on the next line you might have x or y still running, or in memory but without the scheduler knowing yet. Unlikely, but possible.

The test implicitly relies on tasks of unknown duration to be stolen (#5572). It should be changed not to rely on this specific use case.

Finally, why bother calling steal.stop()? If the block_reduce tasks can't be stolen, it should be inconsequential.

Overhauled test in #7250. This PR can be merged as is.

crusaderky · 2022-11-03T12:32:54Z

If they are contradicting tests, why is it flaky? Shouldn't it reliably fail? This still sounds like there is a bug somewhere

It doesn't fail because it doesn't actually test where the tasks end up being executed.
Unsure why it's flaky - @hendrikmakait any insight? Do you have a log?

hendrikmakait · 2022-11-03T13:53:24Z

Unsure why it's flaky - @hendrikmakait any insight? Do you have a log?

I've run it a couple thousand times locally without being able to reproduce the flake. Since the implementation of test_steal_communication_heavy_tasks does not do anything useful (and is weird), I would not want to spend more time debugging it.

test_do_not_steal_communication_heavy_tasks doesn't look right.

You have a good point there, I have dropped that one as well and added three other tests that should cover the functionality that we initially wanted to test here. I like your use of wait_for_state in #7250 and will create a follow-up PR that introduces that to the assert_balance and _run_dependency_test helpers to get rid of the awkward looping.

Finally, why bother calling steal.stop()? If the block_reduce tasks can't be stolen, it should be inconsequential.

steal.stop() functions as a glorified wait() that blocks until all stealing requests have been dealt with and are no longer in_flight.

crusaderky · 2022-11-08T13:08:38Z

distributed/tests/test_steal.py

@@ -1446,6 +1446,57 @@ def func(*args):
    assert (ntasks_per_worker < ideal * 1.5).all(), (ideal, ntasks_per_worker)


+def test_balance_steal_communication_heavy_tasks():
+    dependencies = {"a": 10, "b": 10}


If I change this line to {"a": 1e-6, "b": 1e-6} the test remains green, and if go lower than that I instead get "ValueError: Expected a value larger than 16 integer but got 10."
Are we sure we're actually testing anything here?

This test breaks if we double the cost of the dependencies to {"a": 20, "b": 20} as the tasks have become too expensive to move in this setup. By reducing the cost of the dependencies, we are checking whether we would move tasks that are cheap to move which is the desired behavior. Reducing the cost below 1e-6 creates trouble since we are calculating the actual size of the dependencies as the product of their specified cost and the available bandwidth.

crusaderky · 2022-11-08T13:09:34Z

I think I was too hasty in approving.

See comment above
You forgot to actually delete test_steal_communication_heavy_tasks
Isn't test_do_not_steal_communication_heavy_tasks redundant now?

hendrikmakait · 2022-11-08T13:22:48Z

You forgot to actually delete test_steal_communication_heavy_tasks

Isn't test_do_not_steal_communication_heavy_tasks redundant now?

Yikes, looks like merging main went wrong and brought those back in, sorry for missing that! See #7269 for a follow-up that fixes that.

Drop test_steal_communication_heavy_tasks since it contradicts test_d…

f1b50eb

…o_not_steal_communication_heavy_tasks

hendrikmakait self-assigned this Nov 2, 2022

crusaderky reviewed Nov 3, 2022

View reviewed changes

crusaderky mentioned this pull request Nov 3, 2022

Review test_do_not_steal_communication_heavy_tasks #7250

Merged

crusaderky approved these changes Nov 3, 2022

View reviewed changes

Write new tests for desired functionality under test

bd5ee26

hendrikmakait changed the title ~~Drop test_steal_communication_heavy_tasks since it contradicts test_d…~~ Replace test_(do_not_)steal_communication_heavy_tasks tests with more robust versions Nov 3, 2022

Merge branch 'main' into drop-test_steal_communication_heavy_tasks

2510468

hendrikmakait requested a review from crusaderky November 3, 2022 17:09

crusaderky approved these changes Nov 8, 2022

View reviewed changes

crusaderky merged commit d88d175 into dask:main Nov 8, 2022

crusaderky reviewed Nov 8, 2022

View reviewed changes

hendrikmakait mentioned this pull request Nov 8, 2022

Drop test_(do_not_)steal_communication_heavy_tasks #7269

Merged

2 tasks

This was referenced Nov 8, 2022

Fix test_balance_expensive_tasks and improve helper functions in test_steal.py #7253

Merged

Improved test for balancing expensive tasks #7272

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace `test_(do_not_)steal_communication_heavy_tasks` tests with more robust versions #7243

Replace `test_(do_not_)steal_communication_heavy_tasks` tests with more robust versions #7243

hendrikmakait commented Nov 2, 2022 •

edited

Loading

github-actions bot commented Nov 2, 2022 •

edited

Loading

fjetter commented Nov 3, 2022

hendrikmakait commented Nov 3, 2022

crusaderky Nov 3, 2022 •

edited

Loading

crusaderky Nov 3, 2022

crusaderky commented Nov 3, 2022 •

edited

Loading

hendrikmakait commented Nov 3, 2022

crusaderky Nov 8, 2022

hendrikmakait Nov 8, 2022

crusaderky commented Nov 8, 2022

hendrikmakait commented Nov 8, 2022

		@@ -825,44 +825,6 @@ def block_reduce(x, y, event):
		assert not b.data

Replace test_(do_not_)steal_communication_heavy_tasks tests with more robust versions #7243

Replace test_(do_not_)steal_communication_heavy_tasks tests with more robust versions #7243

Conversation

hendrikmakait commented Nov 2, 2022 • edited Loading

github-actions bot commented Nov 2, 2022 • edited Loading

Unit Test Results

fjetter commented Nov 3, 2022

hendrikmakait commented Nov 3, 2022

crusaderky Nov 3, 2022 • edited Loading

Choose a reason for hiding this comment

crusaderky Nov 3, 2022

Choose a reason for hiding this comment

crusaderky commented Nov 3, 2022 • edited Loading

hendrikmakait commented Nov 3, 2022

crusaderky Nov 8, 2022

Choose a reason for hiding this comment

hendrikmakait Nov 8, 2022

Choose a reason for hiding this comment

crusaderky commented Nov 8, 2022

hendrikmakait commented Nov 8, 2022

Replace `test_(do_not_)steal_communication_heavy_tasks` tests with more robust versions #7243

Replace `test_(do_not_)steal_communication_heavy_tasks` tests with more robust versions #7243

hendrikmakait commented Nov 2, 2022 •

edited

Loading

github-actions bot commented Nov 2, 2022 •

edited

Loading

crusaderky Nov 3, 2022 •

edited

Loading

crusaderky commented Nov 3, 2022 •

edited

Loading