You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I had searched in the DSIP and found no similar DSIP.
Motivation
Right now, there exist some case might cause the task duplicated dispatched.
e.g.
The master dispatch task a to worker A first, but receive a timeout response, this might happen when the worker rpc is busy, then master will select a new worker B and retry the dispatch.
Then there might exist two situations:
The task has been received by worker A, then take will duplicate exist in worker A and worker B, both the two worker will execute the task, a worser case is the task might duplicated in more worker.
The task hasn't been received by worker A, then task will not duplicate executed.
The first situation is not accepted.
Design Detail
In order to solve this, we should change the dispatch logic.
ruanwenjun
changed the title
[DSIP-66] Fix master dispatch task timeout might cause task duplicate running in worker
[DSIP-69] Fix master dispatch task timeout might cause task duplicate running in worker
Aug 28, 2024
Search before asking
Motivation
Right now, there exist some case might cause the task duplicated dispatched.
The master dispatch task a to worker A first, but receive a timeout response, this might happen when the worker rpc is busy, then master will select a new worker B and retry the dispatch.e.g.
Then there might exist two situations:
The first situation is not accepted.
Design Detail
In order to solve this, we should change the dispatch logic.
Compatibility, Deprecation, and Migration Plan
No response
Test Plan
No response
Code of Conduct
The text was updated successfully, but these errors were encountered: