[BUG] Snapshotter fail to apply watch when revision is already compacted. #599

ishan16696 · 2023-03-07T06:12:50Z

Describe the bug:

When snapshotter get close due to some error, so it tries to restart the snapshotter by apply the watch on etcd. But due to etcd's auto-compaction etcd might already compacted the revision number say X and when backup-restore tries to apply the watch on revision which is <=X (means revision number is already compacted) this will leads to error in watch connection hence watch channel get close, and backup-restore never able to restart the snapshotter.

Expected behavior:
If snapshotter fail to apply watch when revision is already compacted then it should take a full snapshot to come out of this situation.

How To Reproduce (as minimally and precisely as possible):

Start the etcd and backup-restore.
Let the backup-restore take one full-snapshot and apply watch from revision 2.
Close the backup-restore
Put some dummy data in etcd
Run compaction on etcd using etcdctl compact <Revision no>
Start the backup-restore:

INFO[0019] Applied watch on etcd from revision: 2        actor=snapshotter
WARN[0019] Failed to collect events for first delta snapshot(s): etcdserver: mvcc: required revision has been compacted  actor=backup-restore-server
INFO[0019] Starting the garbage collector...             actor=backup-restore-server
INFO[0019] Starting snapshotter...                       actor=backup-restore-server
INFO[0019] Will take next full snapshot at time: 2023-03-07 20:16:00 +0530 IST  actor=snapshotter
INFO[0019] Starting the Snapshot EventHandler.           actor=snapshotter
INFO[0019] Closing the Snapshotter...                    actor=snapshotter
ERRO[0019] Snapshotter failed with error: watch channel closed  actor=backup-restore-server
INFO[0019] Snapshotter stopped.                          actor=backup-restore-server

Screenshots (if applicable):

Environment (please complete the following information):

Etcd version/commit ID :
Etcd-backup-restore version/commit ID: v0.22.0
Cloud Provider [All/AWS/GCS/ABS/Swift/OSS]: All

Anything else we need to know?:

The text was updated successfully, but these errors were encountered:

ishan16696 · 2023-03-07T06:18:54Z

/assign

ishan16696 added the kind/bug Bug label Mar 7, 2023

gardener-robot assigned ishan16696 Mar 7, 2023

ishan16696 added the priority/1 Priority (lower number equals higher priority) label Mar 7, 2023

This was referenced Mar 7, 2023

Take a full snapshot when backup-restore fails to apply watch if required revision has been compacted. #600

Merged

[Enhancement] Add a test for edge case #601

Open

ishan16696 closed this as completed in #600 Mar 7, 2023

gardener-robot added the status/closed Issue is closed (either delivered or triaged) label Mar 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Snapshotter fail to apply watch when revision is already compacted. #599

[BUG] Snapshotter fail to apply watch when revision is already compacted. #599

ishan16696 commented Mar 7, 2023

ishan16696 commented Mar 7, 2023

[BUG] Snapshotter fail to apply watch when revision is already compacted. #599

[BUG] Snapshotter fail to apply watch when revision is already compacted. #599

Comments

ishan16696 commented Mar 7, 2023

ishan16696 commented Mar 7, 2023