Cluster panics when duplicate node url is encountered #2279

cboggs · 2015-04-14T15:37:14Z

Steps to reproduce:

Spin up 3 node cluster with Node1 as leader and Node2/3 joining Node1 on initial startup
Shut down Node1
On Node1, rm -rf /var/opt/influxdb/*
Edit Node1 config to join Node2 and Node3 on next startup
Start Node1
Observe cluster-wide panic
Restart InfluxDB on Node2 and Node3 - these two nodes seem to start correctly as long as Node1 stays offline

Here's what I see in the logs on Node2 and Node3:

[raft] 2015/04/14 15:30:04 apply: add node: duplicate node url
panic: apply: add node: duplicate node url

goroutine 8 [running]:
log.(*Logger).Panicf(0xc20800a730, 0x9ed850, 0x13, 0xc2083f3d58, 0x1, 0x1)
    /root/.gvm/gos/go1.4.2/src/log/log.go:200 +0xd1
github.com/influxdb/influxdb/raft.(*Log).mustApplyAddPeer(0xc20803a820, 0xc208392c90)
    /root/.gvm/pkgsets/go1.4.2/global/src/github.com/influxdb/influxdb/raft/log.go:1518 +0x2ce
github.com/influxdb/influxdb/raft.(*Log).applyNextUnappliedEntry(0xc20803a820, 0xc20800c6c0, 0x0, 0x0)
    /root/.gvm/pkgsets/go1.4.2/global/src/github.com/influxdb/influxdb/raft/log.go:1436 +0x752
github.com/influxdb/influxdb/raft.(*Log).applier(0xc20803a820, 0xc20800c6c0)
    /root/.gvm/pkgsets/go1.4.2/global/src/github.com/influxdb/influxdb/raft/log.go:1382 +0x161
created by github.com/influxdb/influxdb/raft.func·002
    /root/.gvm/pkgsets/go1.4.2/global/src/github.com/influxdb/influxdb/raft/log.go:389 +0x764

The text was updated successfully, but these errors were encountered:

cboggs · 2015-04-14T15:49:01Z

Deleting the 'failed' node via the API succeeds, and the dead server no longer shows up in "show servers" queries, but trying to start that node again causes the panic all the same.

However, deleting the server via the API before breaking it (by deleting the data dirs) actually allows the node to rejoin the cluster. Awesome!

Just need to iron out the "panic when a node dies before it is explicitly removed from the cluster" behavior, I think.

jwilder · 2015-04-14T15:50:36Z

Related #1471 #1472

otoolep · 2015-06-09T23:52:41Z

No longer applicable in new design.

beckettsean added this to the 0.9.0 milestone Apr 14, 2015

toddboom added the category/clustering label Apr 23, 2015

cboggs mentioned this issue Apr 27, 2015

Clustering - add duplicate node - crash #2438

Closed

jwilder self-assigned this May 1, 2015

toddboom modified the milestones: 0.9.0, 0.9.1 May 8, 2015

toddboom modified the milestones: 0.9.1, 0.9.2 Jun 5, 2015

otoolep closed this as completed Jun 9, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster panics when duplicate node url is encountered #2279

Cluster panics when duplicate node url is encountered #2279

cboggs commented Apr 14, 2015

cboggs commented Apr 14, 2015

jwilder commented Apr 14, 2015

otoolep commented Jun 9, 2015

Cluster panics when duplicate node url is encountered #2279

Cluster panics when duplicate node url is encountered #2279

Comments

cboggs commented Apr 14, 2015

cboggs commented Apr 14, 2015

jwilder commented Apr 14, 2015

otoolep commented Jun 9, 2015