Master service fails when orchestrator is down #203

juexun · 2019-01-15T05:04:49Z

The rebooted node failed with the following message:
New IP of rebooted node: 10.233.5.244
Old IP before rebooted: 10.233.5.243
Other nodes without rebooted: 10.233.0.160 and 10.233.17.75

Rebooted node side

2019/01/15 04:52:56 [INFO] raft: Node at 10.233.5.244:10008 [Follower] entering Follower state (Leader: "")
2019/01/15 04:52:58 [WARN] raft: Heartbeat timeout from "" reached, starting election
2019/01/15 04:52:58 [INFO] raft: Node at 10.233.5.244:10008 [Candidate] entering Candidate state
2019/01/15 04:52:58 [WARN] raft: Remote peer 10.233.0.160:10008 does not have local node 10.233.5.244:10008 as a peer
2019/01/15 04:52:58 [WARN] raft: Remote peer 10.233.17.75:10008 does not have local node 10.233.5.244:10008 as a peer

existed nodes side

019/01/15 04:55:55 [DEBUG] raft: Failed to contact 10.233.5.243:10008 in 3m59.07956023s
2019/01/15 04:55:56 [DEBUG] raft: Failed to contact 10.233.5.243:10008 in 3m59.528575654s
2019/01/15 04:55:56 [WARN] raft: Rejecting vote request from 10.233.5.244:10008 since we have a leader: 10.233.17.75:10008

The rebooted node failed to join the existed cluster because

the ip of rebooted node had changed
other nodes keep the old ip of rebooted node

The text was updated successfully, but these errors were encountered:

AMecea · 2019-01-15T10:52:26Z

Hi @juexun, This issue is duplicate of #107. The issue is because orchestrator does not refresh the IPs, as you observed. What I recommend is to use only one node for orchestrator until #107 is fixed.

juexun · 2019-01-16T07:41:20Z

@AMecea , Thanks.

juexun · 2019-01-16T09:01:48Z

If this issue can not fixed, the orchestrator will be the SOPF of the cluster. The master service of mysql is unreachable if orchestrator died

calind added this to the 0.2.4 milestone Jan 21, 2019

calind added the bug label Jan 21, 2019

calind assigned AMecea Jan 21, 2019

calind changed the title ~~Orchestrator: nodes failed to join cluster after it had been rebooted.~~ Master service fails when orchestrator is down Jan 28, 2019

AMecea mentioned this issue Jan 29, 2019

Keep last known state when orchestrator is down #219

Merged

calind closed this as completed in #219 Feb 1, 2019

chapsuk pushed a commit to chapsuk/mysql-operator that referenced this issue Oct 16, 2023

Add kustomize as a direct go.mod dependency (bitpoke#203)

64db013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Master service fails when orchestrator is down #203

Master service fails when orchestrator is down #203

juexun commented Jan 15, 2019 •

edited

Loading

AMecea commented Jan 15, 2019

juexun commented Jan 16, 2019

juexun commented Jan 16, 2019

Master service fails when orchestrator is down #203

Master service fails when orchestrator is down #203

Comments

juexun commented Jan 15, 2019 • edited Loading

AMecea commented Jan 15, 2019

juexun commented Jan 16, 2019

juexun commented Jan 16, 2019

juexun commented Jan 15, 2019 •

edited

Loading