Downsampling performance analysis and improvement #90226

salvatore-campagna · 2022-09-22T10:11:31Z

Description

We would like to measure performances of downsampling operations using Rally. For this purpose we need to include a new challenge to the existing tsdb Rally track. The new challenge will measure latency for a limited set of downsampling operations using different values for the fixed_interval parameter. As part of the analysis we need to collect JFR recordings and flame graphs so that we can spot areas of the code we can improve.

Right now the tsdb track uses a dataset including more than 116M documents for a total JSON file size of more than 120 GB, which results in a 32.5 GB index. The plan is to measure downsampling latency with a single thread implementation, a single node Elasticsearch cluster and a single shard.

The text was updated successfully, but these errors were encountered:

elasticsearchmachine · 2022-09-22T10:11:59Z

Pinging @elastic/es-analytics-geo (Team:Analytics)

salvatore-campagna · 2022-09-22T10:15:08Z

Attaching flame graph collected while running two downsample operations and JFR recording collected for the whole challenge:

downsample-1h: using fixed_interval: 1h
downsample-1d:using fixed_interval: 1d

downsample-flamegraph-d6e36b58-52ff-495f-b64d-a765c368f7ad.html.zip

profile-d6e36b58-52ff-495f-b64d-a765c368f7ad.jfr.zip

salvatore-campagna · 2022-09-22T10:25:02Z

As a result of analyzing both the JFR recording and the flame graph I see two improvements which are worth working on:

making sure we don't decode keyword fields (BytesRef to UTF) and just use the BytesRef (see RollupShardIndexer)
making sure we get rid of hash map access and iteration while collecting fields (see RollupShardIndexer)

Other than that, time is spent reading doc values which is expected.

NOTE: after merging PR #90088 we see consistent and significant improvements in latency. Latest tests show that both the downsampling operations (1h and 1d fixed interval) take around 30 minutes to complete.

salvatore-campagna · 2022-09-22T12:31:09Z

Downsampling the same source index using 1m fixed interval took about 1.5 hours producing an index with about 7M documents. Attaching JFR recording and flames graph.

profile-9c51f1f9-816b-4fa7-8739-aa140dd6d3e6.jfr.zip

downsample-flamegraph-9c51f1f9-816b-4fa7-8739-aa140dd6d3e6.html.zip

salvatore-campagna · 2022-12-29T11:31:54Z

Closing after the following PR has been merged #92494

salvatore-campagna added >enhancement :StorageEngine/Rollup Turn fine-grained time-based data into coarser-grained data Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) labels Sep 22, 2022

salvatore-campagna self-assigned this Sep 22, 2022

salvatore-campagna mentioned this issue Sep 22, 2022

Add better support for metric data types (TSDB) #74660

Closed

salvatore-campagna closed this as completed Dec 29, 2022

craigtaverner changed the title ~~Downsamplig performance analysis and improvement~~ Downsampling performance analysis and improvement Feb 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Downsampling performance analysis and improvement #90226

Downsampling performance analysis and improvement #90226

salvatore-campagna commented Sep 22, 2022 •

edited

Loading

elasticsearchmachine commented Sep 22, 2022

salvatore-campagna commented Sep 22, 2022 •

edited

Loading

salvatore-campagna commented Sep 22, 2022 •

edited

Loading

salvatore-campagna commented Sep 22, 2022 •

edited

Loading

salvatore-campagna commented Dec 29, 2022

Downsampling performance analysis and improvement #90226

Downsampling performance analysis and improvement #90226

Comments

salvatore-campagna commented Sep 22, 2022 • edited Loading

Description

elasticsearchmachine commented Sep 22, 2022

salvatore-campagna commented Sep 22, 2022 • edited Loading

salvatore-campagna commented Sep 22, 2022 • edited Loading

salvatore-campagna commented Sep 22, 2022 • edited Loading

salvatore-campagna commented Dec 29, 2022

salvatore-campagna commented Sep 22, 2022 •

edited

Loading

salvatore-campagna commented Sep 22, 2022 •

edited

Loading

salvatore-campagna commented Sep 22, 2022 •

edited

Loading

salvatore-campagna commented Sep 22, 2022 •

edited

Loading