Don't cancel allocation when a new sync id is found on shared filesystems #16357

dakrone · 2016-02-01T22:38:16Z

In ReplicaShardAllocator.processExistingRecoveries, if we find a "better" match, we cancel allocation of a replica:

// we found a better match that has a full sync id match, the existing allocation is not fully synced
// so we found a better one, cancel this one
it.moveToUnassigned(new UnassignedInfo(UnassignedInfo.Reason.REALLOCATED_REPLICA,
        "existing allocation of replica to [" + currentNode + "] cancelled, sync id match found on node [" + nodeWithHighestMatch + "]"));

However, when on a shared filesystem, all data nodes have the same data, so we should not cancel allocation if a new node pops up.

The text was updated successfully, but these errors were encountered:

dakrone · 2016-02-01T22:38:41Z

@bleskes I spoke with @brwe about this and I think we agreed it was worth doing, but I'm curious about your input on this as well.

…found Currently the message stays in the `UnassignedInfo` for the shard, however, it would be very useful to know the exact point (time-wise) that the cancellation happened when diagnosing an issue. Relates to debugging elastic#16357

…found Currently the message stays in the `UnassignedInfo` for the shard, however, it would be very useful to know the exact point (time-wise) that the cancellation happened when diagnosing an issue. Relates to debugging #16357

dakrone · 2017-05-26T18:53:34Z

Shadow replicas have been removed and this is no longer applicable

dakrone added >enhancement :Allocation labels Feb 1, 2016

dakrone self-assigned this Feb 1, 2016

dakrone mentioned this issue Mar 8, 2016

Log when cancelling allocation of a replica because a new syncid was found #17008

Merged

bleskes mentioned this issue Dec 7, 2016

Remove Shadow Replicas #22024

Closed

dakrone closed this as completed May 26, 2017

lcawl added :Distributed Indexing/Distributed A catch all label for anything in the Distributed Area. Please avoid if you can. and removed :Allocation labels Feb 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't cancel allocation when a new sync id is found on shared filesystems #16357

Don't cancel allocation when a new sync id is found on shared filesystems #16357

dakrone commented Feb 1, 2016

dakrone commented Feb 1, 2016

dakrone commented May 26, 2017

Don't cancel allocation when a new sync id is found on shared filesystems #16357

Don't cancel allocation when a new sync id is found on shared filesystems #16357

Comments

dakrone commented Feb 1, 2016

dakrone commented Feb 1, 2016

dakrone commented May 26, 2017