Allow unknown tasks to be stolen #5572

fjetter · 2021-12-08T13:44:02Z

This effectively reverts #5392 which enforced that unknown tasks couldn't be stolen. This was flagged as a regression since a test asserting this behaviour was introduced in very early stages of the stealing development #278. Back then we didn't have any intermediate runtime information about the task but this is no longer true and this behaviour can be revised.

Closes Task stealing regression in 2021-11-0+ (preventing task load balancing) #5564

For testability, I added WorkStealing.start and WorkStealing.stop since I believe for proper timing based tests we should have the possibility to disable the background PC. Otherwise it is very hard to assert specifically what's happening. If preferred, I can factor this out into a dedicated PR

gjoseph92

This looks good to me. Having start and stop is nice!

gjoseph92 · 2021-12-08T19:01:44Z

distributed/tests/test_steal.py

+
+    steal = s.extensions["stealing"]
+    await steal.stop()
+    lock = await Semaphore(max_leases=1)


Why a Semaphore instead of a Lock?

I had issues with the lock and have had some in the past. It somehow didn't lock. I may have forgotten to await something, idk. fwiw, I think we should throw lock away and replace it with the semaphore implementation.

Lock = lambda: Semaphore(max_leases=1) seems reasonable to me. Nice to have fewer codepaths.

I would do something like the following but that's a matter of taste :)

class Lock(Semaphore): def __init__(*args, **kwargs): super().__init__(*args, **kwargs, max_leases=1)

The semaphore also implements lease timeouts such that it handles dying workers gracefully by releasing the lock.

IIRC, the reason why I didn't do this from the beginning is that there are subtly differences in API causing the Lock API to break. For instance, you need to await a semaphore object before using it while you don't need to do this on a lock, etc. Also a lot of the tests are specific about the extension. However, we might probable just be able to delete all Lock specific tests since the semaphore should cover it anyhow.

fjetter · 2021-12-09T18:05:40Z

~~There appears to be a genuine test failure with test_allow_tasks_stolen_before_first_completes~~ Fixed

fjetter added 2 commits December 8, 2021 14:39

Allow unknown tasks to be stolen

a349920

Set work stealing interval as part of config

03e2111

fjetter force-pushed the allow_unknown_tasks_stolen branch from 0ddda3e to 03e2111 Compare December 8, 2021 18:55

gjoseph92 approved these changes Dec 8, 2021

View reviewed changes

Fix stealing teardown

8724408

fix test failures

82e6638

fjetter linked an issue Dec 10, 2021 that may be closed by this pull request

Task stealing regression in 2021-11-0+ (preventing task load balancing) #5564

Closed

fjetter merged commit 96a4cea into dask:main Dec 10, 2021

gjoseph92 mentioned this pull request Dec 16, 2021

workload not balancing during scale up on dask-gateway #5599

Open

fjetter added regression stealing labels Jun 20, 2022

fjetter mentioned this pull request Aug 11, 2022

Root-ish tasks all schedule onto one worker #6573

Closed

crusaderky mentioned this pull request Nov 3, 2022

Replace test_(do_not_)steal_communication_heavy_tasks tests with more robust versions #7243

Merged

2 tasks

hendrikmakait mentioned this pull request Nov 8, 2022

Fix test_balance_expensive_tasks and improve helper functions in test_steal.py #7253

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow unknown tasks to be stolen #5572

Allow unknown tasks to be stolen #5572

fjetter commented Dec 8, 2021 •

edited

Loading

gjoseph92 left a comment

gjoseph92 Dec 8, 2021

fjetter Dec 9, 2021

gjoseph92 Dec 9, 2021

fjetter Dec 9, 2021

fjetter commented Dec 9, 2021 •

edited

Loading

Allow unknown tasks to be stolen #5572

Allow unknown tasks to be stolen #5572

Conversation

fjetter commented Dec 8, 2021 • edited Loading

gjoseph92 left a comment

Choose a reason for hiding this comment

gjoseph92 Dec 8, 2021

Choose a reason for hiding this comment

fjetter Dec 9, 2021

Choose a reason for hiding this comment

gjoseph92 Dec 9, 2021

Choose a reason for hiding this comment

fjetter Dec 9, 2021

Choose a reason for hiding this comment

fjetter commented Dec 9, 2021 • edited Loading

fjetter commented Dec 8, 2021 •

edited

Loading

fjetter commented Dec 9, 2021 •

edited

Loading