[pkg/translator/prometheusremotewrite] inefficient memory use during conversion to native histograms #24405

krajorama · 2023-07-20T06:56:03Z

Component(s)

pkg/translator/prometheusremotewrite

Describe the issue you're reporting

During implementation of #17565 I noticed that much (~50%) of time is spent on reallocating slices for the resulting spans and deltas.

Background

From conversation with @charleskorn :
I see your PR grafana/mimir#5531, there's a similar issue in translating exponential histograms to native histograms where the spans and deltas are getting reallocated all the time. Except it's hard to give a good estimate there about the size needed (although upper bound is obvious) due to the difference in otel and prom representations + downscaling possibly .

How bad is it if we take the upper bound as the estimate?

Haven't calculated yet, kind of on my backlog. For deltas the reduction is potentially 2^n where n=scale difference between exponential and native histogram (in my code it's the scaledown): https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/65e6596c028[…]22439f45e788/pkg/translator/prometheusremotewrite/histograms.go . But there's a chance some buckets are not neatly merge-able , so you cannot say that the reduction is exactly 2^n .

For spans it might be that all buckets fit in a span because they are consecutive. On the other hand, you might find that every bucket needs its own span. It might be worth calculating the number of deltas and spans in a first pass and just filling it later. Or maybe a better strategy is to allocate in batches (+10%?, double?), no idea.

It'd be interesting to compare doing a first pass to calculate the number of deltas and spans and being able to allocate accurately with guessing or assuming the worst case (edited)

For the batches, something that worked well for a similar problem when streaming chunks from [Mimir] ingesters to queriers was to use a linked list of batches of a fixed size, then create one final slice with the correct size and copy the elements from the batches - this saved a bunch of repetitive work copying elements into expanded slices that only had to be copied again when we expanded the slice again. These batches can also be pooled, so over time you only pay the cost of the final slice (edited)

But given the difference in the amount of data involved here, might not be worth it (in the ingesters, we were dealing with substantial structs if I remember correctly)

github-actions · 2023-07-20T06:56:19Z

Pinging code owners:

pkg/translator/prometheusremotewrite: @Aneurysm9 @kovrus

See Adding Labels via Comments if you do not have permissions to add labels yourself.

github-actions · 2023-07-26T16:24:50Z

Pinging code owners for exporter/prometheusremotewrite: @Aneurysm9 @rapphil. See Adding Labels via Comments if you do not have permissions to add labels yourself.

frzifus · 2023-07-26T16:25:35Z

cc @sh0rez

github-actions · 2023-09-25T03:30:36Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/prometheusremotewrite: @Aneurysm9 @rapphil
pkg/translator/prometheusremotewrite: @Aneurysm9

See Adding Labels via Comments if you do not have permissions to add labels yourself.

github-actions · 2023-11-27T03:29:28Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/prometheusremotewrite: @Aneurysm9 @rapphil
pkg/translator/prometheusremotewrite: @Aneurysm9

See Adding Labels via Comments if you do not have permissions to add labels yourself.

jmichalek132 · 2023-11-27T09:38:42Z

still relevant.

github-actions · 2024-01-29T03:30:01Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/prometheusremotewrite: @Aneurysm9 @rapphil
pkg/translator/prometheusremotewrite: @Aneurysm9

See Adding Labels via Comments if you do not have permissions to add labels yourself.

jmichalek132 · 2024-01-29T14:07:50Z

still relevant.

github-actions · 2024-04-01T03:29:36Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/prometheusremotewrite: @Aneurysm9 @rapphil
pkg/translator/prometheusremotewrite: @Aneurysm9

See Adding Labels via Comments if you do not have permissions to add labels yourself.

aknuds1 · 2024-04-24T06:41:57Z

I can have a look at this.

github-actions · 2024-06-25T03:30:50Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/prometheusremotewrite: @Aneurysm9 @rapphil
pkg/translator/prometheusremotewrite: @Aneurysm9

See Adding Labels via Comments if you do not have permissions to add labels yourself.

github-actions · 2024-08-24T05:19:29Z

This issue has been closed as inactive because it has been stale for 120 days with no activity.

krajorama added the needs triage New item requiring triage label Jul 20, 2023

github-actions bot added the pkg/translator/prometheusremotewrite label Jul 20, 2023

frzifus added exporter/prometheusremotewrite and removed needs triage New item requiring triage labels Jul 26, 2023

github-actions bot added the Stale label Sep 25, 2023

frzifus removed the Stale label Sep 25, 2023

github-actions bot added the Stale label Nov 27, 2023

github-actions bot removed the Stale label Nov 28, 2023

crobert-1 mentioned this issue Jan 19, 2024

prometheusremotewrite exporter with histogram is causing metrics export failure due to high memory (90%) #30675

Closed

github-actions bot added the Stale label Jan 29, 2024

github-actions bot removed the Stale label Jan 30, 2024

github-actions bot added the Stale label Apr 1, 2024

github-actions bot removed the Stale label Apr 25, 2024

github-actions bot added the Stale label Jun 25, 2024

github-actions bot added the closed as inactive label Aug 24, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Aug 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pkg/translator/prometheusremotewrite] inefficient memory use during conversion to native histograms #24405

[pkg/translator/prometheusremotewrite] inefficient memory use during conversion to native histograms #24405

krajorama commented Jul 20, 2023

github-actions bot commented Jul 20, 2023

github-actions bot commented Jul 26, 2023

frzifus commented Jul 26, 2023

github-actions bot commented Sep 25, 2023

github-actions bot commented Nov 27, 2023

jmichalek132 commented Nov 27, 2023

github-actions bot commented Jan 29, 2024

jmichalek132 commented Jan 29, 2024

github-actions bot commented Apr 1, 2024

aknuds1 commented Apr 24, 2024 •

edited

Loading

github-actions bot commented Jun 25, 2024

github-actions bot commented Aug 24, 2024

[pkg/translator/prometheusremotewrite] inefficient memory use during conversion to native histograms #24405

[pkg/translator/prometheusremotewrite] inefficient memory use during conversion to native histograms #24405

Comments

krajorama commented Jul 20, 2023

Component(s)

Describe the issue you're reporting

github-actions bot commented Jul 20, 2023

github-actions bot commented Jul 26, 2023

frzifus commented Jul 26, 2023

github-actions bot commented Sep 25, 2023

github-actions bot commented Nov 27, 2023

jmichalek132 commented Nov 27, 2023

github-actions bot commented Jan 29, 2024

jmichalek132 commented Jan 29, 2024

github-actions bot commented Apr 1, 2024

aknuds1 commented Apr 24, 2024 • edited Loading

github-actions bot commented Jun 25, 2024

github-actions bot commented Aug 24, 2024

aknuds1 commented Apr 24, 2024 •

edited

Loading