Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

delete master Tablet lead to Tablet crash #7261

Closed
xhh1989 opened this issue Jan 7, 2021 · 4 comments · Fixed by #7563
Closed

delete master Tablet lead to Tablet crash #7261

xhh1989 opened this issue Jan 7, 2021 · 4 comments · Fixed by #7563

Comments

@xhh1989
Copy link
Contributor

xhh1989 commented Jan 7, 2021

image

I0104 14:49:15.547715 151 shard_watcher.go:56] Stopping shard watch...
I0104 14:49:15.547810 151 shard_watcher.go:62] Shard watch stopped.
panic: runtime error: invalid memory address or nil pointer dereference
[signal SIGSEGV: segmentation violation code=0x1 addr=0x8 pc=0xbcf711]
goroutine 323 [running]:
vitess.io/vitess/go/vt/topo.(*Server).GetTablet(0xc0002163c0, 0x2778420, 0xc002e276e0, 0x0, 0x0, 0x0, 0x0)
/vt/src/vitess.io/vitess/go/vt/topo/tablet.go:227 +0x51
vitess.io/vitess/go/vt/vttablet/tabletmanager.(*TabletManager).setMasterLocked(0xc0003b2750, 0x2778420, 0xc002e276e0, 0x0, 0x0, 0x0, 0x0, 0xc000e08301, 0x271e760, 0xc00028eaf0)
/vt/src/vitess.io/vitess/go/vt/vttablet/tabletmanager/rpc_replication.go:564 +0x200
vitess.io/vitess/go/vt/vttablet/tabletmanager.(*TabletManager).SetMaster(0xc0003b2750, 0x2778420, 0xc002e276e0, 0x0, 0x0, 0x0, 0x0, 0xc0013e0101, 0x0, 0x0)
/vt/src/vitess.io/vitess/go/vt/vttablet/tabletmanager/rpc_replication.go:482 +0x11d
vitess.io/vitess/go/vt/vttablet/tabletmanager.(*TabletManager).abortMasterTerm(0xc0003b2750, 0x27783a0, 0xc000227480, 0x0, 0x0, 0x0)
/vt/src/vitess.io/vitess/go/vt/vttablet/tabletmanager/shard_sync.go:228 +0x65b
vitess.io/vitess/go/vt/vttablet/tabletmanager.(*TabletManager).shardSyncLoop(0xc0003b2750, 0x27783a0, 0xc000227480, 0xc0003f27e0, 0xc0003f28a0)
/vt/src/vitess.io/vitess/go/vt/vttablet/tabletmanager/shard_sync.go:117 +0x632
created by vitess.io/vitess/go/vt/vttablet/tabletmanager.(*TabletManager).startShardSync
/vt/src/vitess.io/vitess/go/vt/vttablet/tabletmanager/shard_sync.go:247 +0x173
@deepthi
Copy link
Member

deepthi commented Jan 7, 2021

Out of curiosity, why are you deleting master tablet from topo?

@xhh1989
Copy link
Contributor Author

xhh1989 commented Jan 7, 2021

Out of curiosity, why are you deleting master tablet from topo?

yes, delete master Tablet while it is serving

@biostone
Copy link

biostone commented Jan 7, 2021

Out of curiosity, why are you deleting master tablet from topo?

To simulate a software/hardware failure maybe?

@sougou
Copy link
Contributor

sougou commented Jan 8, 2021

Slack thread for context: https://vitess.slack.com/archives/C0PQY0PTK/p1609748160463200

I think the TL:DR; is that we should just gracefully exit if we discover that the tablet record is gone. And also extending this rule, we should exit if we detect that a different vttablet has taken over the record.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants