-
-
Notifications
You must be signed in to change notification settings - Fork 719
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AssertionError: waiting on released dep
when worker-saturation < .inf
#7204
Comments
I think this failure is a red herring. I don't think there's actually any issue here. The test in #7065 is not appropriate to run with queuing on. The test asserts many things that are specific to no-queuing co-assignment. What happens is simply that this assertion fails with queuing on (since there's obviously no last worker):
Then the test hangs for a while because it's trying to clean up the cluster, but the workers' threadpools are all busy, so eventually it just times out. I don't think the validation errors you're seeing here are real problems, I think they're artifacts of an ugly cluster shutdown. I haven't been able to come up with a way to formulate this test into something that would be meaningful with queuing. This test is relying on a couple of bad behaviors:
We already have some queueing tests around paused workers. Maybe we should add one for retiring workers too, just to be safe, though it should be the same code path. But I don't think there's a bug here; instead, we should just make the test in #7065 only run when queuing is turned off. |
By the current definition of rootish, the only way a rootish group could depend on another rootish group would be to have exactly 1 worker with 1 thread:
the above cannot be satisfied for both dependent and dependency already with total_nthreads=2. |
Correct. I'm saying the current definition of rootish is a bit silly because something that is definitively a root task (has no dependencies) is not considered rootish. And something that shouldn't be considered rootish (the |
test_decide_worker_rootish_while_last_worker_is_retiring
(#7065) fails withworker-saturation: 1.5
:The text was updated successfully, but these errors were encountered: