Deleting an index during concurrent taking of more than one snapshot causes future restore to fail #1779

aYukiSekiguchi · 2021-12-20T05:09:21Z

Describe the bug
Elasticsearch 7.10.2 which OpenSearch forked has a known issue about snapshot and restore:
https://www.elastic.co/guide/en/elasticsearch/reference/7.10/release-notes-7.10.2.html#known-issues-7.10.2

If an index is deleted while the cluster is concurrently taking more than one snapshot then there is a risk that one of the snapshots may never complete and also that some shard data may be lost from the repository, causing future restore operations to fail.

To Reproduce
Steps to reproduce the behavior:
I haven't reproduced the behavior, but I guess...

Delete index while the cluster is concurrently taking more than one snapshot.
There is a risk that one of the snapshots may never complete and also that some shard data may be lost from the repository, causing future restore operations to fail.

Expected behavior
Concurrent snapshot and restore work.

Plugins
I haven't reproduced the behavior, but I guess no plugin is needed.

Screenshots
None

Host/Environment (please complete the following information):
I haven't reproduced the behavior, but I guess all environment are affected.

Additional context
I checked the patch in Elasticsearch and the same file in OpenSearch main branch. It looks like OpenSearch has the same issue.

Bukhtawar · 2021-12-21T19:12:05Z

This is specific to Amazon OpenSearch Service which has been well documented here https://docs.aws.amazon.com/opensearch-service/latest/developerguide/supported-operations.html#version_7_10

dblock · 2021-12-21T20:24:27Z

Is there something we should/can do in OpenSearch itself? If not close this?

reta · 2021-12-21T20:35:00Z

@dblock the bare bone OpenSearch works just fine:

{
    "acknowledged": true,
    "persistent": {
        "snapshot": {
            "max_concurrent_operations": "1"
        }
    },
    "transient": {}
}

As per @Bukhtawar , it is specific to AWS OpenSearch offering: only limited settings are supported.

aYukiSekiguchi · 2021-12-23T05:48:54Z

I understand it is the problem of Amazon OpenSearch Service that the mitigation setting doesn't work.

However, my question is "Does OpenSearch have the know issue which Elasticsearch 7.10.2 has?"

To clarify, I quote the known issue from Elasticsearch 7.10.2 Release Note:

Snapshot and restore: If an index is deleted while the cluster is concurrently taking more than one snapshot then there is a risk that one of the snapshots may never complete and also that some shard data may be lost from the repository, causing future restore operations to fail.

...

This issue is fixed in Elasticsearch versions 7.13.1 and later. It is not possible to repair a repository once it is affected by this issue, so you must restore the repository from a backup, or clear the repository by executing DELETE _snapshot//*, or move to a fresh repository. For more details, see #73456.

reta · 2021-12-23T13:21:41Z

@aYukiSekiguchi OpenSearch is a fork of Elasticsearch as of 7.10.2, so it is very likely that the issue is still present

dblock · 2021-12-23T15:15:35Z

@aYukiSekiguchi @Bukhtawar lets reopen an re-describe the original problem? It sounds like the root issue is that “If an index is deleted while the cluster is concurrently taking more than one snapshot then there is a risk that one of the snapshots may never complete and also that some shard data may be lost from the repository, causing future restore operations to fail.” We should fix this. Please note that we cannot take non-AL2 code from ES.

aYukiSekiguchi · 2021-12-24T04:17:47Z

I updated the description and removed about the mitigation setting because it confused some people.

Note for future readers:
The snapshot.max_concurrent_operations mitigation in Elasticsearch 7.10.2 Release Note should work on the bare bone OpenSearch. However, it looks like Amazon OpenSearch Service doesn't support the setting.

xuezhou25 · 2022-09-20T07:09:37Z

Updating my thoughts on the root cause:
Snapshot creation and deletion can not be proceed at the same time.
After a snapshot deletion, actions will be triggered on in-progress snapshots, see:

OpenSearch/server/src/main/java/org/opensearch/snapshots/SnapshotsService.java

Lines 2897 to 2905 in 658f7a6

    
                    * The removal of a delete from the cluster state can trigger two possible actions on in-progress snapshots: 
        
                    * <ul> 
        
                    *     <li>Snapshots that had unfinished shard snapshots in state {@link ShardSnapshotStatus#UNASSIGNED_QUEUED} that 
        
                    *     could not be started because the delete was running can have those started.</li> 
        
                    *     <li>Snapshots that had all their shards reach a completed state while a delete was running (e.g. as a result of 
        
                    *     nodes dropping out of the cluster or another incoming delete aborting them) need not be updated in the cluster 
        
                    *     state but need to have their finalization triggered now that it's possible with the removal of the delete 
        
                    *     from the state.</li> 
        
                    * </ul>

If there is a snapshot creation in the queue during the snapshot deletion, which means there are unfinished shard snapshots in UNASSIGNED_QUEUED state, there will be a step to check the current shard status in updatedSnapshotsInProgress from this line:

OpenSearch/server/src/main/java/org/opensearch/snapshots/SnapshotsService.java

Line 2957 in 6f6e84e

    
           // We don't have a new assignment for this shard because its index was concurrently deleted

When an index is deleted before the check, ShardSnapshotStatus will be put into MISSING state, which is taken as shard snapshot "completed". Then there is an incorrect use in updating snapshotEntries, in this line:

OpenSearch/server/src/main/java/org/opensearch/snapshots/SnapshotsService.java

Line 2968 in 6f6e84e

    
           snapshotEntries.add(entry.withStartedShards(updatedAssignmentsBuilder.build()));

The method withStartedShards assumes that the ShardSnapshotStatus is completed, see:

OpenSearch/server/src/main/java/org/opensearch/cluster/SnapshotsInProgress.java

Lines 552 to 556 in 6f6e84e

    
                   /** 
        
                    * Same as {@link #withShardStates} but does not check if the snapshot completed and thus is only to be used when starting new 
        
                    * shard snapshots on data nodes for a running snapshot. 
        
                    */ 
        
                   public Entry withStartedShards(ImmutableOpenMap<ShardId, ShardSnapshotStatus> shards) {

It lends to the completed snapshots not being finalized, and this is the code for finalizing the completed snapshots:

OpenSearch/server/src/main/java/org/opensearch/snapshots/SnapshotsService.java

Lines 2972 to 2974 in 6f6e84e

    
           // Entry is already completed so we will finalize it now that the delete doesn't block us after 
        
           // this CS update finishes 
        
           newFinalizations.add(entry);

The code defect may cause state consistency issue during restoring the snapshot: the snapshot status is uncompleted, while the ShardSnapshotStatus is completed.

AVERTISSEMENT: Uncaught exception in thread: Thread[opensearch[node_t0][clusterManagerService#updateTask][T#1],5,TGRP-DedicatedClusterSnapshotRestoreIT]
java.lang.AssertionError: Completed state must imply all shards completed but saw state [STARTED] and shards [[test-index][0]=>ShardSnapshotStatus[state=MISSING, nodeId=null, reason=missing index, generation=null]]

aYukiSekiguchi added bug Something isn't working untriaged labels Dec 20, 2021

dblock closed this as completed Dec 22, 2021

dblock reopened this Dec 23, 2021

dblock changed the title ~~[BUG] Snapshot and restore known issue in Elasticsearch 7.10.2~~ Deleting an index during concurrent taking of more than one snapshot causes future restore to fail Dec 23, 2021

anasalkouz added distributed framework and removed untriaged labels Dec 28, 2021

kotwanikunal assigned xuezhou25 Sep 14, 2022

xuezhou25 mentioned this issue Sep 22, 2022

Fix SnapshotsInProgress bug during index deletion #4570

Merged

6 tasks

xuezhou25 closed this as completed in #4570 Oct 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deleting an index during concurrent taking of more than one snapshot causes future restore to fail #1779

Deleting an index during concurrent taking of more than one snapshot causes future restore to fail #1779

aYukiSekiguchi commented Dec 20, 2021 •

edited by xuezhou25

Loading

Bukhtawar commented Dec 21, 2021

dblock commented Dec 21, 2021

reta commented Dec 21, 2021

aYukiSekiguchi commented Dec 23, 2021 •

edited

Loading

reta commented Dec 23, 2021

dblock commented Dec 23, 2021 •

edited

Loading

aYukiSekiguchi commented Dec 24, 2021

xuezhou25 commented Sep 20, 2022 •

edited

Loading

Deleting an index during concurrent taking of more than one snapshot causes future restore to fail #1779

Deleting an index during concurrent taking of more than one snapshot causes future restore to fail #1779

Comments

aYukiSekiguchi commented Dec 20, 2021 • edited by xuezhou25 Loading

Bukhtawar commented Dec 21, 2021

dblock commented Dec 21, 2021

reta commented Dec 21, 2021

aYukiSekiguchi commented Dec 23, 2021 • edited Loading

reta commented Dec 23, 2021

dblock commented Dec 23, 2021 • edited Loading

aYukiSekiguchi commented Dec 24, 2021

xuezhou25 commented Sep 20, 2022 • edited Loading

aYukiSekiguchi commented Dec 20, 2021 •

edited by xuezhou25

Loading

aYukiSekiguchi commented Dec 23, 2021 •

edited

Loading

dblock commented Dec 23, 2021 •

edited

Loading

xuezhou25 commented Sep 20, 2022 •

edited

Loading