-
Notifications
You must be signed in to change notification settings - Fork 103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deadlocks on bootstrapping in distributed settings #109
Comments
I see a deadlock in my trivial test with three nodes [1], while it runs fine with two nodes. Is there anything wrong with my test code? [1] https://github.com/xinhaoyuan/morpheus-app-test/tree/master/swarm_test_simple |
I also have the same issue with 3 nodes. |
I think this issue is fixed by #118, please reopen if you still get it on master. |
It appears the same in my test case (with commit 1430212). |
I'm still having the same issue with latest master branch. |
reopening for visibility |
Hi,
I'm testing the concurrency of swarm using my in-house tool (will be released soon!). I've found several deadlocks when bootstrapping swarm in distributed settings. Here is one potential sketch of how a deadlock could happen:
node1
,node2
were incluster_wait
statenode1
getscluster_join
, select to sync withnode2
, and enterssyncing
statenode2
putssync
ofnode1
intopending_sync_req
node2
getscluster_join
, select to sync withnode1
(sync ofnode1
is still in pending!), and entersyncing
statenode1
getssync
fromnode2
and discovered a tie, it decides to waitnode2
forsync_reply
node1
's sync tonode2
is in pending, it will never get handled (and thus nosync_reply
will be sent tonode1
)The text was updated successfully, but these errors were encountered: