Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Task Manager Resiliency: Rebuild the submit and job queues when the task manager comes back up. #674

Closed
aquan9 opened this issue Apr 10, 2023 · 2 comments
Assignees

Comments

@aquan9
Copy link
Collaborator

aquan9 commented Apr 10, 2023

Pieces broken up from #614

@jtronge
Copy link
Collaborator

jtronge commented Jan 16, 2024

Does this already happen on restart? It looks like the task manager is just loading up the old database file, which should still have the submit and job queue data, unless the whole database gets deleted.

@jtronge jtronge self-assigned this Jan 24, 2024
@jtronge
Copy link
Collaborator

jtronge commented Jan 30, 2024

I did some testing with examples/clamr-ffmpeg-build and it looks like the task manager is able to recover the submit/job queues after failures and successfully update task states. I'm going to close this for now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

No branches or pull requests

2 participants