Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ML] Transforms/DFA: Wizard performance degrades with indices with a lot of fields #78590

Closed
walterra opened this issue Sep 28, 2020 · 3 comments · Fixed by #98259
Closed

[ML] Transforms/DFA: Wizard performance degrades with indices with a lot of fields #78590

walterra opened this issue Sep 28, 2020 · 3 comments · Fixed by #98259
Assignees
Labels
bug Fixes for quality problems that affect the customer experience Feature:Transforms ML transforms :ml v7.13.0

Comments

@walterra
Copy link
Contributor

With indices with a lot of fields (e.g. filebeat with 4000+ fields), performance of the transform wizard can significantly drop, resulting in page freezes of several seconds on each rerender. Some investigation is needed to identify root causes of the problem. One suspicion is that the dropdowns of the pivot configuration cause the freezes as the dropdown option might get recalculated on each render.

@walterra walterra added bug Fixes for quality problems that affect the customer experience :ml Feature:Transforms ML transforms labels Sep 28, 2020
@walterra walterra self-assigned this Sep 28, 2020
@elasticmachine
Copy link
Contributor

Pinging @elastic/ml-ui (:ml)

@walterra walterra changed the title [ML] Transform: Wizard performance degrades with indices with a lot of fields [ML] Transforms/DFA: Wizard performance degrades with indices with a lot of fields Oct 22, 2020
@walterra
Copy link
Contributor Author

Further investigation shows that this is not caused by the dropdown but the data grid.

image

Tested with a metricbeat index with 3500+ fields but just a small amount of documents. Loading the wizard takes 10+ seconds, with freezes of multiple seconds on every rerender that touches data grid.

Removing the data grid component for the source index gets the page back to a working state.

@walterra
Copy link
Contributor Author

walterra commented Feb 22, 2021

Proposed solution/workaround:

Similar to Discover, fetch a sample of e.g. 500 docs, find out which fields are populated/actually used in documents, and filter the list of available fields passed on to the data grid based on that. A toggle should allow the user to switch off that filtering. In Discover, this reduces the fields returned from 3500+ to ~125 based on the metricbeat index above.

This can be also used to filter the available options in the groupby/aggs-dropdowns.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Fixes for quality problems that affect the customer experience Feature:Transforms ML transforms :ml v7.13.0
Projects
None yet
2 participants