Auto-expand replicas should clamp to closest value #84788

DaveCTurner · 2022-03-09T08:17:13Z

Elasticsearch Version

All

Installed Plugins

N/A

Java Version

bundled

OS Version

N/A

Problem Description

Today if an index has an index.auto_expand_replicas setting which cannot be realised in the cluster then the number of replicas is left unchanged by AllocationService#adaptAutoExpandReplicas(). Instead, I think we should change the number of replicas to the closest value within the permitted range.

For instance: if a cluster has a single data node which is marked as shutting down for restart then auto-expand replicas considers this cluster to have zero data nodes, and hence requires -1 replicas. If an index is created with index.auto_expand_replicas: "0-1" then it will not be adjusted to have zero replicas, even though this would be better than leaving it at the default value of 1 replica, and will ultimately be the correct setting.

Steps to Reproduce

Create a cluster with one data node.
Put a RESTART shutdown marker on the data node.
Create an index with index.auto_expand_replicas: "0-1"
Observe that this results in an unassigned replica.

Logs (if relevant)

No response

The text was updated successfully, but these errors were encountered:

elasticmachine · 2022-03-09T08:17:16Z

Pinging @elastic/es-distributed (Team:Distributed)

henningandersen · 2022-05-09T11:26:48Z

For instance: if a cluster has a single data node which is marked as shutting down for restart

Notice that this example is no longer valid since #85277, but the general problem persists.

Ensure that the number of replicas chosen for an auto-expand-able shard is within the range of the available data nodes, i.e., excluding those nodes that cannot be assigned a replica. Closes elastic#84788

Ensure that the number of replicas chosen for an auto-expand-able shard is within the range of the available data nodes, i.e., excluding those nodes that cannot be assigned a replica. Closes #84788

DaveCTurner added >bug :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) labels Mar 9, 2022

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Mar 9, 2022

romain-chanu mentioned this issue Mar 10, 2022

migrations fail with "Not enough active copies to meet shard count of [ALL] (have 1, needed 2)" elastic/kibana#127136

Closed

rudolf mentioned this issue Apr 12, 2022

Reduce Kibana upgrade plan failures on Cloud elastic/kibana#129899

Closed

pxsalehi self-assigned this Jun 2, 2022

pxsalehi mentioned this issue Jun 8, 2022

Clamp auto-expand replicas to the closest value #87505

Merged

pxsalehi closed this as completed in #87505 Jun 13, 2022

lukeelmers mentioned this issue Jun 22, 2022

[migrations] Add descriptive logs for unavailable_shards_exception elastic/kibana#134951

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto-expand replicas should clamp to closest value #84788

Auto-expand replicas should clamp to closest value #84788

DaveCTurner commented Mar 9, 2022 •

edited

Loading

elasticmachine commented Mar 9, 2022

henningandersen commented May 9, 2022

Auto-expand replicas should clamp to closest value #84788

Auto-expand replicas should clamp to closest value #84788

Comments

DaveCTurner commented Mar 9, 2022 • edited Loading

Elasticsearch Version

Installed Plugins

Java Version

OS Version

Problem Description

Steps to Reproduce

Logs (if relevant)

elasticmachine commented Mar 9, 2022

henningandersen commented May 9, 2022

DaveCTurner commented Mar 9, 2022 •

edited

Loading