-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tests/integration: data store mgr tests unstable #4175
Comments
Have run it twenty times locally and no failure... I will see if there's a more robust way to test the update. |
I'd have to understand how the test harness works at a deeper level to know why it's inconsistent.. i.e. why does this:
Sometimes yield the scheduler with no task proxies? Did the tasks not exist when the hold command was issued? Hard to tell unless I can reproduce the issue somehow.. (the test in question still worked when I commented out the hold) |
I think the same harness yield gets shared amongst the tests.. So perhaps create another harness object (if possible), and see if it occurs again ... Again, easier if I could reproduce.. |
On a side note, this also works:
So I don't think the key lookup is working..
|
Ah, perhaps we have to explicitly get the pool to release tasks, otherwise there is a race condition between the |
I assume this is still an issue? |
I still run into this one, it seems to happen more often with a higher level of parallelism in the tests. Still have no idea why it's happening. |
Was playing with a small change to the integration tests which caused these tests to break badly so took a look in. I think the problem is:
I've addressed (1) on my branch by adding a new fixture that lets us start scheduler without running the main loop to give these tests a fighting chance. I've fixed the resulting failures bar one, which I don't understand in
The test resets a task state to failed, then expects the number of |
Done (1) in #4620, may have helped, however, I still get flaky failures ~50% of the time for:
With high parallelism. |
Well, not sure what to do about (2) other than have tests operate on different workflows, as it's not something that would happen in the wild.. Also I'm not sure the tests could "undo" their changes, |
The data store mgr tests or a subset thereof sometimes fail and sometimes pass.
Some example traceback:
The test file:
The text was updated successfully, but these errors were encountered: