Cannot force allocate primary to a node where the shard already exists #22031

abeyad · 2016-12-07T13:58:55Z

Before, it was possible that the SameShardAllocationDecider would allow
force allocation of an unassigned primary to the same node on which an
active replica is assigned. This could only happen with shadow replica
indices, because when a shadow replica primary fails, the replica gets
promoted to primary but in the INITIALIZED state, not in the STARTED
state (because the engine has specific reinitialization that must take
place in the case of shadow replicas). Therefore, if the now promoted
primary that is initializing fails also, the primary will be in the
unassigned state, because replica to primary promotion only happens when
the failed shard was in the started state. The now unassigned primary
shard will go through the allocation deciders, where the
SameShardsAllocationDecider would return a NO decision, but would still
permit force allocation on the primary if all deciders returned NO.

This commit implements canForceAllocatePrimary on the
SameShardAllocationDecider, which ensures that a primary cannot be
force allocated to the same node on which an active replica already
exists.

Relates #22021

Before, it was possible that the SameShardAllocationDecider would allow force allocation of an unassigned primary to the same node on which an active replica is assigned. This could only happen with shadow replica indices, because when a shadow replica primary fails, the replica gets promoted to primary but in the INITIALIZED state, not in the STARTED state (because the engine has specific reinitialization that must take place in the case of shadow replicas). Therefore, if the now promoted primary that is initializing fails also, the primary will be in the unassigned state, because replica to primary promotion only happens when the failed shard was in the started state. The now unassigned primary shard will go through the allocation deciders, where the SameShardsAllocationDecider would return a NO decision, but would still permit force allocation on the primary if all deciders returned NO. This commit implements canForceAllocatePrimary on the SameShardAllocationDecider, which ensures that a primary cannot be force allocated to the same node on which an active replica already exists.

ywelsch

Can you add a test in SameShardRoutingTests?

abeyad · 2016-12-07T18:57:55Z

@ywelsch I pushed a test in 82c8137

ywelsch

Left one suggestion. I think it's also ok if this doesn't go to 5.0.3

ywelsch · 2016-12-08T15:10:18Z

core/src/test/java/org/elasticsearch/cluster/routing/allocation/SameShardRoutingTests.java

+        );
+
+        // can't force allocate same shard copy to the same node
+        ShardRouting newPrimary = ShardRouting.newUnassigned(primaryShard.shardId(), true,


TestShardRouting automatically provides some of the randomness that you explicitly do here.

abeyad · 2016-12-08T16:03:48Z

I pushed 9603046 which uses TestShardRouting. Thanks for the review @ywelsch

#22031) Before, it was possible that the SameShardAllocationDecider would allow force allocation of an unassigned primary to the same node on which an active replica is assigned. This could only happen with shadow replica indices, because when a shadow replica primary fails, the replica gets promoted to primary but in the INITIALIZED state, not in the STARTED state (because the engine has specific reinitialization that must take place in the case of shadow replicas). Therefore, if the now promoted primary that is initializing fails also, the primary will be in the unassigned state, because replica to primary promotion only happens when the failed shard was in the started state. The now unassigned primary shard will go through the allocation deciders, where the SameShardsAllocationDecider would return a NO decision, but would still permit force allocation on the primary if all deciders returned NO. This commit implements canForceAllocatePrimary on the SameShardAllocationDecider, which ensures that a primary cannot be force allocated to the same node on which an active replica already exists.

abeyad · 2016-12-08T17:28:56Z

5.x commit: 6d3e6f2

* master: Skip IP range query REST test prior to 5.1.2 Bump version to 5.1.2 Don't allow yaml tests with `warnings` that don't skip `warnings` (elastic#21989) Cannot force allocate primary to a node where the shard already exists (elastic#22031) Fix REST test for ip range aggregations. Build: NORELEASE is the same as norelease (elastic#22006) S3/Azure snapshot repo documentation wrong for "read_only"

abeyad added :Allocation >bug v5.0.3 v5.1.2 v5.2.0 v6.0.0-alpha1 labels Dec 7, 2016

ywelsch reviewed Dec 7, 2016

View reviewed changes

adds a test

82c8137

ywelsch approved these changes Dec 8, 2016

View reviewed changes

abeyad removed the v5.0.3 label Dec 8, 2016

Use TestShardRouting

9603046

abeyad merged commit 3da0429 into elastic:master Dec 8, 2016

abeyad deleted the same_shard_alloc_cant_force branch December 8, 2016 17:21

abeyad removed the v5.1.2 label Dec 8, 2016

lcawl added :Distributed Indexing/Distributed A catch all label for anything in the Distributed Area. Please avoid if you can. and removed :Allocation labels Feb 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot force allocate primary to a node where the shard already exists #22031

Cannot force allocate primary to a node where the shard already exists #22031

abeyad commented Dec 7, 2016

ywelsch left a comment

abeyad commented Dec 7, 2016

ywelsch left a comment

ywelsch Dec 8, 2016

abeyad commented Dec 8, 2016

abeyad commented Dec 8, 2016

Cannot force allocate primary to a node where the shard already exists #22031

Cannot force allocate primary to a node where the shard already exists #22031

Conversation

abeyad commented Dec 7, 2016

ywelsch left a comment

Choose a reason for hiding this comment

abeyad commented Dec 7, 2016

ywelsch left a comment

Choose a reason for hiding this comment

ywelsch Dec 8, 2016

Choose a reason for hiding this comment

abeyad commented Dec 8, 2016

abeyad commented Dec 8, 2016