Sample data creation fails for the new feature #2633

Arpit-Bandejiya · 2022-10-20T11:02:42Z

Describe the bug

Due to the changes done here in Opensearch : https://github.com/opensearch-project/OpenSearch/pull/3462/files#diff-013717f93370bf1d9635d1b84aee81e7e003e3fd6c6bb7c74b9890a1327a04b6

We are seeing that the sample data creation is failing due to low replica count

To Reproduce
Steps to reproduce the behavior:

create a cluster with the feature mentioned above enabled( set thecluster.routing.allocation.awareness.balance in the opensearch.yaml file to enable the feature).
click on sample data creation in dashboard.

Expected behaviour
We should be able to create the sample data from the dashboard.

OpenSearch Version
latest version

Dashboards Version
Any dashboard version supported

Plugins

Please list all plugins currently enabled.

Host/Environment (please complete the following information):

OS: Mac OS
Browser: Chrome

The text was updated successfully, but these errors were encountered:

ananzh · 2022-10-20T18:00:51Z

@Arpit-Bandejiya do we when this will be release in OS? is it v2.4? Thx

AMoo-Miki · 2022-10-20T18:12:01Z

@Arpit-Bandejiya Does it produce any logs or errors during the failure?
Also, when you say latest version, do you mean main branch of OpenSearch or the 2.3 release?

kavilla · 2022-10-20T18:13:28Z

@Arpit-Bandejiya can you provide insight on what the fix here? And if our sample data is failing could it be possible that others will experience this in other ingest software? Therefore, is this not technically a breaking change and it should a 3.x change?

Arpit-Bandejiya · 2022-10-21T06:06:17Z

@Arpit-Bandejiya can you provide insight on what the fix here? And if our sample data is failing could it be possible that others will experience this in other ingest software? Therefore, is this not technically a breaking change and it should a 3.x change?

Replica count enforcement is done only when cluster.routing.allocation.awareness.balance is enabled. This feature is disabled by default. Hence it is not a breaking change.

@Arpit-Bandejiya Does it produce any logs or errors during the failure?
Also, when you say latest version, do you mean main branch of OpenSearch or the 2.3 release?

error reponse:

{
  "error" : {
    "root_cause" : [
      {
        "type" : "invalid_index_template_exception",
        "reason" : "index_template [template_1] invalid, cause [Validation Failed: 1: expected total copies needs to be a multiple of total awareness attributes [3];]"
      }
    ],
    "type" : "invalid_index_template_exception",
    "reason" : "index_template [template_1] invalid, cause [Validation Failed: 1: expected total copies needs to be a multiple of total awareness attributes [3];]"
  },
  "status" : 400
}

This feature is present in main as well as in 2.3 release

joshuarrrr · 2022-10-25T20:58:01Z

Triage - does the sample data need to support any combination of user settings/options (what's the purpose and use-case for sample data)?

UX: should sample data support non-default cluster configurations?

ohltyler · 2023-02-13T22:00:56Z

@Arpit-Bandejiya can you describe more on what the fix should be? Does the sample data indices settings need to have a dynamic way for specifying certain settings fields, such as auto_expand_replicas? From the error messages, it seems it will need to be dynamic based on the cluster's total awareness attributes

ohltyler · 2023-02-14T16:45:55Z

Update - I've learned the key cluster setting to be aware of is the max awareness attribute value, of which there could be multiple (AZs, rack IDs, etc.). The upper limit of auto_expand_replicas must be a multiple of that. Note that this setting by default does not take into account awareness attributes. From documentation:

Note that the auto-expanded number of replicas only takes allocation filtering rules into account, but ignores any other allocation rules such as shard allocation awareness snd total shards per node

Because of this, if cluster.routing.allocation.awareness.balance is set to true, and a user ingests sample data, there is no current way (I believe) to easily read the total awareness attribute value and update the index setting before index creation, and so the ingestion may fail if the replica count isn't a multiple of the max AZ count.

Maybe just adding documentation around this setting is sufficient. @gbbafna can you point me to the current documentation for this setting? I can't seem to find it in the OpenSearch docs.

I will defer the decision to the feature owner and Dashboards team for deciding on the path forward. From a plugin owner perspective, it is more logical and maintainable to maintain the same sample data index configuration as that of core Dashboards, and so I will work on a fix in the AD plugin to consume such settings.

gbbafna · 2023-02-15T07:23:19Z

Hi @ohltyler : Please find the documentation in https://opensearch.org/docs/latest/tuning-your-cluster/cluster/ . Search for Replica count enforcement in here.

gbbafna · 2023-02-15T07:25:56Z

We have also added default_replica_count as a cluster level setting : opensearch-project/OpenSearch#5610 . For sample data, it should be fine to use that instead of using auto expand replica at all . Using that , AD won't need to bother about all of the cluster settings used as well .

ohltyler · 2023-02-15T16:38:04Z

Yes- totally agree. Thanks for providing this option!

We can eliminate this setting and consume cluster defaults. I will work on making that change on the AD plugin side.

ohltyler · 2023-02-15T18:48:35Z

Update: AD-related changes have been merged & backported - see opensearch-project/anomaly-detection-dashboards-plugin#423

Arpit-Bandejiya added bug Something isn't working untriaged labels Oct 20, 2022

ananzh removed the bug Something isn't working label Oct 20, 2022

Arpit-Bandejiya changed the title ~~[BUG] Sample data creation fails for the new feature~~ Sample data creation fails for the new feature Oct 21, 2022

joshuarrrr added low priority enhancement New feature or request help wanted Community development is encouraged needs research ux / ui Improvements or additions to user experience, flows, components, UI elements and removed untriaged labels Oct 25, 2022

ohltyler mentioned this issue Feb 13, 2023

Sample data ingestions fails when cluster.routing.allocation.awareness.balance enabled opensearch-project/anomaly-detection-dashboards-plugin#417

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sample data creation fails for the new feature #2633

Sample data creation fails for the new feature #2633

Arpit-Bandejiya commented Oct 20, 2022 •

edited

Loading

ananzh commented Oct 20, 2022

AMoo-Miki commented Oct 20, 2022 •

edited

Loading

kavilla commented Oct 20, 2022

Arpit-Bandejiya commented Oct 21, 2022

joshuarrrr commented Oct 25, 2022

ohltyler commented Feb 13, 2023

ohltyler commented Feb 14, 2023

gbbafna commented Feb 15, 2023

gbbafna commented Feb 15, 2023 •

edited

Loading

ohltyler commented Feb 15, 2023

ohltyler commented Feb 15, 2023 •

edited

Loading

Sample data creation fails for the new feature #2633

Sample data creation fails for the new feature #2633

Comments

Arpit-Bandejiya commented Oct 20, 2022 • edited Loading

ananzh commented Oct 20, 2022

AMoo-Miki commented Oct 20, 2022 • edited Loading

kavilla commented Oct 20, 2022

Arpit-Bandejiya commented Oct 21, 2022

joshuarrrr commented Oct 25, 2022

ohltyler commented Feb 13, 2023

ohltyler commented Feb 14, 2023

gbbafna commented Feb 15, 2023

gbbafna commented Feb 15, 2023 • edited Loading

ohltyler commented Feb 15, 2023

ohltyler commented Feb 15, 2023 • edited Loading

Arpit-Bandejiya commented Oct 20, 2022 •

edited

Loading

AMoo-Miki commented Oct 20, 2022 •

edited

Loading

gbbafna commented Feb 15, 2023 •

edited

Loading

ohltyler commented Feb 15, 2023 •

edited

Loading