Scheduler should not be considered idle while a client submits new work #8876
Labels
enhancement
Improve existing functionality or make things work better
good expert issue
Clearly described but requires someone extremely familiar with the project to implement successfully
scheduler
Describe the issue:
I have seen several instances where a cluster with an idle timeout shut down because it took an excessive amount of time for the client to submit new work. In these cases, the scheduler should not have shut down because but rather anticipated that new work will arrive shortly.
As far as I can tell, we can address this in two steps:
Scheduler.update_graph
executes. This method is the main entry point for submitting new work to the cluster and it can take a while when encountering large or complex task graphs, resulting in a cluster shutting down while the scheduler is already preparing future work.The text was updated successfully, but these errors were encountered: