-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
re-implement the task retry state using xtriggers #3423
Conversation
That was quick! |
I'm part way through reviewing this. I think it's all good, but there's a few caveats to think through, as you've pointed out. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, tests as working. Trying to think through the polling logic question...
bash failures are related to docker and memory, see #3856 for fix
|
All functional and unit tests passing now. The bash test |
(The last 3 commits just modify comments, via the GH UI "commit suggestion" button). |
(All tests passing 😁 ) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few typos and pydocs issues. No major issues with rest of the code (unless that change in rundb
is a legit issue). I did find a few interesting things in other parts of the code, but they were not in this pull request (which means I've got more post-its on my screen now to investigate later 😬 )
@hjoliver I tested it with Cylc UIServer, and found no exceptions in the logs. With Cylc UI, however, the complex
workflow logged that error due to missing parents.
I tested master
and the errors didn't occur. But given how little we know about that error, it could be that the same happens on master
, but intermittently. I have no idea if complex
has anything special about it with regards to the change of retry state in this PR 👍
Thanks!
B
I just checked, it doesn't look like any tasks in the complex suite will retry so it must be a coincidence. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Had a look at the files that GH UI showed as changed since my last review (thanks GH for such handy functionality). Looking good to me, +1
@kinow - the same flaky bash test failed again. I think you can approve and merge this if the functional tests all pass. |
(@hjoliver I'm guessing CI is still a bit unstable, but approved since the conflict changed files were only 3, from a PR that I reviewed yesterday; no issues that I could see) |
Missed this comment by 23 seconds, while I was typing. But I managed to approve before your comment 😄 |
(Two approvals 😁 and the bash test fail is a known flaky ... merging) |
Wow, did that really happen. |
TASK_STATUS_RETRYING
TASK_STATUS_SUBMIT_RETRYING
cylc[_submission]_retry_<task>.<cycle>
cylc
namespace for special triggers.Caveats:
<cyclepoint>/*:retrying
)TASK_STATUS_ORDERED
These changes partially address cylc/cylc-admin#47
Requirements check-list
CONTRIBUTING.md
and added my name as a Code Contributor.