Fix Index Deletion during Snapshot Finalization #50202

original-brownbear · 2019-12-14T11:08:24Z

With #45689 making it so that index metadata is written
after all shards have been snapshotted we can't delete indices
that are part of the upcoming snapshot finalization any longer
and it is not sufficient to check if all shards of an index have been
snapshotted before deciding that it is safe to delete it.
This change forbids deleting any index that is in the process of being
snapshot to avoid issues during snapshot finalization.

Relates #50200 (doesn't fully fix yet because we're not fixing the partial=true snapshot case here

With elastic#45689 making it so that index metadata is written after all shards have been snapshotted we can't delete indices that are part of the upcoming snapshot finalization any longer and it is not sufficient to check if all shards of an index have been snapshotted before deciding that it is safe to delete it. This change forbids deleting any index that is in the process of being snapshot to avoid issues during snapshot finalization. Closes elastic#50200

elasticmachine · 2019-12-14T11:08:26Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

original-brownbear · 2019-12-14T11:13:01Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

-                            indices.add(index);
-                        }
-                    }
+            for (IndexId index : entry.indices()) {


Admittedly this makes it a little "harder" to delete an index, but I don't see it as much of an issue relative to the complication it saves. If we don't do it this way, we'd have to add another step to write index metadata per index (once all the shards for that index have sucessfull been snapshotted) to the state machine which doesn't seem worth it?

👍 I don't think this weakens anything meaningful, since the order in which indices are snapshotted isn't specified. The existing behaviour seems overly heroic.

DaveCTurner

LGTM

DaveCTurner · 2019-12-16T07:55:27Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

-                            indices.add(index);
-                        }
-                    }
+            for (IndexId index : entry.indices()) {


👍 I don't think this weakens anything meaningful, since the order in which indices are snapshotted isn't specified. The existing behaviour seems overly heroic.

ywelsch

I think we should not do this. The reason this functionality is in place (allowing deletes during partial snapshots) is that it allows background snapshots while not interfering with user-level actions such as deletes. Note that we discussed this at the time here: #16321

original-brownbear · 2019-12-16T08:27:21Z

@ywelsch makes sense. Are you ok with this behavior for non-partial snapshots though?
If so, I would just fix the finalization of partial snapshots in such a way that missing metadata in the end is just handled as shard failures for the deleted indices. WDYT?

ywelsch · 2019-12-16T08:32:40Z

Are you ok with this behavior for non-partial snapshots though?

yes, I think that's ok, at least to get a quick fix out.

Ugh I missed the vital check for partiality.

ywelsch

We'll address partial snapshots in a follow-up. LGTM

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

original-brownbear · 2019-12-16T11:41:02Z

@ywelsch Thanks! 7.5.1 here now right?

ywelsch · 2019-12-16T11:46:03Z

Let's backport to 7.5 branch for now, and have a separate discussion whether it makes it to 7.5.1

With elastic#45689 making it so that index metadata is written after all shards have been snapshotted we can't delete indices that are part of the upcoming snapshot finalization any longer and it is not sufficient to check if all shards of an index have been snapshotted before deciding that it is safe to delete it. This change forbids deleting any index that is in the process of being snapshot to avoid issues during snapshot finalization. Relates elastic#50200 (doesn't fully fix yet because we're not fixing the `partial=true` snapshot case here

With #45689 making it so that index metadata is written after all shards have been snapshotted we can't delete indices that are part of the upcoming snapshot finalization any longer and it is not sufficient to check if all shards of an index have been snapshotted before deciding that it is safe to delete it. This change forbids deleting any index that is in the process of being snapshot to avoid issues during snapshot finalization. Relates #50200 (doesn't fully fix yet because we're not fixing the `partial=true` snapshot case here

* Fix Index Deletion during Snapshot Finalization (#50202) With #45689 making it so that index metadata is written after all shards have been snapshotted we can't delete indices that are part of the upcoming snapshot finalization any longer and it is not sufficient to check if all shards of an index have been snapshotted before deciding that it is safe to delete it. This change forbids deleting any index that is in the process of being snapshot to avoid issues during snapshot finalization. Relates #50200 (doesn't fully fix yet because we're not fixing the `partial=true` snapshot case here

We can simply filter out shard generation updates for indices that were removed from the cluster state concurrently to fix index deletes during partial snapshots as that completely removes any reference to those shards from the snapshot. Follow up to elastic#50202 Closes elastic#50200

* Fix Index Deletion During Partial Snapshot Create We can simply filter out shard generation updates for indices that were removed from the cluster state concurrently to fix index deletes during partial snapshots as that completely removes any reference to those shards from the snapshot. Follow up to #50202 Closes #50200

* Fix Index Deletion During Partial Snapshot Create We can simply filter out shard generation updates for indices that were removed from the cluster state concurrently to fix index deletes during partial snapshots as that completely removes any reference to those shards from the snapshot. Follow up to elastic#50202 Closes elastic#50200

We can simply filter out shard generation updates for indices that were removed from the cluster state concurrently to fix index deletes during partial snapshots as that completely removes any reference to those shards from the snapshot. Follow up to #50202 Closes #50200

With elastic#45689 making it so that index metadata is written after all shards have been snapshotted we can't delete indices that are part of the upcoming snapshot finalization any longer and it is not sufficient to check if all shards of an index have been snapshotted before deciding that it is safe to delete it. This change forbids deleting any index that is in the process of being snapshot to avoid issues during snapshot finalization. Relates elastic#50200 (doesn't fully fix yet because we're not fixing the `partial=true` snapshot case here

* Fix Index Deletion During Partial Snapshot Create We can simply filter out shard generation updates for indices that were removed from the cluster state concurrently to fix index deletes during partial snapshots as that completely removes any reference to those shards from the snapshot. Follow up to elastic#50202 Closes elastic#50200

original-brownbear added >bug :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.6.0 v7.5.2 labels Dec 14, 2019

original-brownbear added 2 commits December 14, 2019 12:09

remove bs comment

39e52d9

simpler

24b97d9

original-brownbear commented Dec 14, 2019

View reviewed changes

original-brownbear requested review from tlrx, ywelsch and DaveCTurner December 14, 2019 11:27

DaveCTurner previously approved these changes Dec 16, 2019

View reviewed changes

ywelsch suggested changes Dec 16, 2019

View reviewed changes

Merge remote-tracking branch 'elastic/master' into 50200

dd82390

Don't fix partial yet

8ccea00

original-brownbear requested a review from ywelsch December 16, 2019 10:39

ywelsch approved these changes Dec 16, 2019

View reviewed changes

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java Outdated Show resolved Hide resolved

CR: restore comment

07e7302

original-brownbear merged commit aecbb2f into elastic:master Dec 16, 2019

original-brownbear deleted the 50200 branch December 16, 2019 11:47

original-brownbear mentioned this pull request Dec 16, 2019

Fix Index Deletion during Snapshot Finalization (#50202) #50227

Merged

original-brownbear mentioned this pull request Dec 16, 2019

Fix Index Deletion during Snapshot Finalization (#50202) #50228

Merged

original-brownbear mentioned this pull request Dec 16, 2019

Fix Index Deletion During Partial Snapshot Create #50234

Merged

jasontedor added v7.5.1 and removed v7.5.2 labels Dec 16, 2019

original-brownbear mentioned this pull request Dec 17, 2019

Fix Index Deletion During Partial Snapshot Create (#50234) #50266

Merged

This was referenced Feb 3, 2020

[meta] 7.6 release elastic/elasticsearch-net#4340

Closed

[meta] 7.6 release elastic/elasticsearch-net#4341

Closed

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Index Deletion during Snapshot Finalization #50202

Fix Index Deletion during Snapshot Finalization #50202

original-brownbear commented Dec 14, 2019 •

edited

Loading

elasticmachine commented Dec 14, 2019

original-brownbear Dec 14, 2019

DaveCTurner Dec 16, 2019

DaveCTurner left a comment

DaveCTurner Dec 16, 2019

ywelsch left a comment

original-brownbear commented Dec 16, 2019

ywelsch commented Dec 16, 2019

ywelsch left a comment

original-brownbear commented Dec 16, 2019 •

edited

Loading

ywelsch commented Dec 16, 2019

Fix Index Deletion during Snapshot Finalization #50202

Fix Index Deletion during Snapshot Finalization #50202

Conversation

original-brownbear commented Dec 14, 2019 • edited Loading

elasticmachine commented Dec 14, 2019

original-brownbear Dec 14, 2019

Choose a reason for hiding this comment

DaveCTurner Dec 16, 2019

Choose a reason for hiding this comment

DaveCTurner left a comment

Choose a reason for hiding this comment

DaveCTurner Dec 16, 2019

Choose a reason for hiding this comment

ywelsch left a comment

Choose a reason for hiding this comment

original-brownbear commented Dec 16, 2019

ywelsch commented Dec 16, 2019

ywelsch left a comment

Choose a reason for hiding this comment

original-brownbear commented Dec 16, 2019 • edited Loading

ywelsch commented Dec 16, 2019

original-brownbear commented Dec 14, 2019 •

edited

Loading

original-brownbear commented Dec 16, 2019 •

edited

Loading