You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When we lose a worker, any time spent partially executing a task is lost. The task transitions back from processing to released on the scheduler, so that it may be executed somewhere else.
When the scheduler receives metrics from the heartbeat (#7666), it normally immediately forgets which worker they come from. It should make an execption for currently-running (#7677) and keep track of which worker they come from.
When a worker dies, it should subtract all currently-running time for the worker and reclassify it as ("execute", <prefix>, "killed-worker", "seconds").
Additionally, it should add to this measure all tasks that were currently executing as of the last received heartbeat, minus those that completed in the meantime, multiplied by the time between the last heartbeat and the worker death.
The text was updated successfully, but these errors were encountered:
When we lose a worker, any time spent partially executing a task is lost. The task transitions back from
processing
toreleased
on the scheduler, so that it may be executed somewhere else.When the scheduler receives metrics from the heartbeat (#7666), it normally immediately forgets which worker they come from. It should make an execption for
currently-running
(#7677) and keep track of which worker they come from.When a worker dies, it should subtract all
currently-running
time for the worker and reclassify it as("execute", <prefix>, "killed-worker", "seconds")
.Additionally, it should add to this measure all tasks that were currently executing as of the last received heartbeat, minus those that completed in the meantime, multiplied by the time between the last heartbeat and the worker death.
The text was updated successfully, but these errors were encountered: