tsdb: Avoid chunks with >120 samples in MergeOverlappingChunks #5862

codesome · 2019-08-02T09:21:42Z

Currently, during vertical compaction, we directly merge the overlapping chunks and the samples can exceed the limit of 120 in a chunk. Chunk needs to be broken down into smaller chunks if it crosses 120 samples.

The piece of code where the fix goes: https://github.com/prometheus/tsdb/blob/d5b3f0704379a9eaca33b711aa0097f001817fc2/chunks/chunks.go#L208-L240

bwplotka · 2019-08-02T09:28:55Z

Happy to attack this in same PR as well. Thanks for pointing out the place for this one 👍

nidhidhamnani · 2020-01-10T10:13:35Z

I would like to work on this issue

codesome · 2020-01-13T13:33:04Z

@nidhidhamnani go ahead!

Sudhar287 · 2020-01-28T20:35:28Z

Can you please elaborate on why this modification is necessary?

bwplotka · 2020-01-29T06:51:47Z

Sure. @Sudhar287 the reason is that our compression algorithm at the moment is designed to have the best compression ratio statistically with 120 samples. Anything more than that is introducing higher latency (and memory used) for querying because of decoding without really improving stored size.

That's why stick to max 120.

Sudhar287 · 2020-01-29T14:17:36Z

Okay understood. Thanks @bwplotka

brancz · 2020-01-29T15:31:15Z

I think max 120 alone might not really reflect the entirety of what we want. At the same time we want to avoid chunks that have a low amount of samples as they extrapolated in the long run cause larger disk space and thus space that needs to be mapped into memory. I think we need heuristics for both lower and upper bound merge decisions.

codesome · 2020-01-30T14:28:39Z

@brancz yes, there is another issue tracking the merging of small chunks #6332

bboreham · 2020-02-12T17:51:49Z

Is there a paper somewhere that details how 120 was obtained as the right figure?

brancz · 2020-02-12T18:08:38Z

That would be the gorilla paper (graphic on page 6): https://www.vldb.org/pvldb/vol8/p1816-teller.pdf

That said while we know the compression statistically gets optimal there, I’m not sure 120 samples exactly the right heuristic here. As I prefer to have a buffer than slip into likely not optimal space.

hdost · 2020-09-29T19:44:57Z

So it looks like this is some sort of cursed issue. There have been multiple PRs all of which have been closed. Is the consensus still that the samples should be split into separate chunks?

codesome · 2020-09-30T09:40:12Z

@hdost Consensus, yes. They were closed because of different reasons :). More than splitting, it's more about avoiding bigger chunks when merging (we can leave alone the already bigger chunks, not worth breaking them down now for the additional complexity that adds).

Now with some refactoring that @bwplotka did, it should be easier to do it now. I think the relevant code is in storage/merge.go now (look for NewCompactingChunkSeriesMerger).

hdost · 2020-10-07T10:16:38Z

So essentially at this point we just want to make sure we don't compact chunks such that they end up over 120 samples ✔️

yeya24 · 2021-05-18T17:08:05Z

Closed by #8582

bwplotka self-assigned this Aug 2, 2019

bwplotka transferred this issue from prometheus-junkyard/tsdb Aug 13, 2019

bwplotka changed the title ~~Avoid chunks with >120 samples in MergeOverlappingChunks~~ tsdb: Avoid chunks with >120 samples in MergeOverlappingChunks Aug 13, 2019

bwplotka added the component/tsdb label Aug 13, 2019

codesome added help wanted low hanging fruit labels Jan 10, 2020

This was referenced Feb 4, 2020

added chunk breaking functionality Sudhar287/prometheus#1

Closed

added chunk breaking functionality #6760

Closed

nicolai86 mentioned this issue May 7, 2020

split chunks into groups of at most 120 samples when merging #7219

Closed

brian-brazil added the priority/P3 label May 18, 2020

pstibrany mentioned this issue Jul 27, 2020

Fetch remaining data if chunk is bigger than 16000 bytes thanos-io/thanos#2945

Closed

beorn7 added the hacktoberfest label Sep 28, 2020

metalmatze mentioned this issue Mar 10, 2021

storage: Split chunks if more than 120 samples #8582

Merged

codesome closed this as completed May 19, 2021

prometheus locked as resolved and limited conversation to collaborators Nov 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tsdb: Avoid chunks with >120 samples in MergeOverlappingChunks #5862

tsdb: Avoid chunks with >120 samples in MergeOverlappingChunks #5862

codesome commented Aug 2, 2019

bwplotka commented Aug 2, 2019

nidhidhamnani commented Jan 10, 2020

codesome commented Jan 13, 2020

Sudhar287 commented Jan 28, 2020

bwplotka commented Jan 29, 2020 •

edited

Loading

Sudhar287 commented Jan 29, 2020

brancz commented Jan 29, 2020

codesome commented Jan 30, 2020

bboreham commented Feb 12, 2020

brancz commented Feb 12, 2020 •

edited

Loading

hdost commented Sep 29, 2020

codesome commented Sep 30, 2020

hdost commented Oct 7, 2020

yeya24 commented May 18, 2021

tsdb: Avoid chunks with >120 samples in MergeOverlappingChunks #5862

tsdb: Avoid chunks with >120 samples in MergeOverlappingChunks #5862

Comments

codesome commented Aug 2, 2019

bwplotka commented Aug 2, 2019

nidhidhamnani commented Jan 10, 2020

codesome commented Jan 13, 2020

Sudhar287 commented Jan 28, 2020

bwplotka commented Jan 29, 2020 • edited Loading

Sudhar287 commented Jan 29, 2020

brancz commented Jan 29, 2020

codesome commented Jan 30, 2020

bboreham commented Feb 12, 2020

brancz commented Feb 12, 2020 • edited Loading

hdost commented Sep 29, 2020

codesome commented Sep 30, 2020

hdost commented Oct 7, 2020

yeya24 commented May 18, 2021

bwplotka commented Jan 29, 2020 •

edited

Loading

brancz commented Feb 12, 2020 •

edited

Loading