Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Short-circuit chunk dedup in proxy #5816

Merged
merged 1 commit into from
Oct 25, 2022

Conversation

fpetkovski
Copy link
Contributor

@fpetkovski fpetkovski commented Oct 25, 2022

The store proxy deduplicates series by calculating a hash on the each aggregate chunk. If at least one hash is not present in the dedup-map, the entire series chunk is added to the response, including all aggregate chunks.

Because of this, we can short-circuit deduping chunks and stop early if any aggregate chunks is not previously seen.

Signed-off-by: Filip Petkovski filip.petkovsky@gmail.com

  • I added CHANGELOG entry for this change.
  • Change is not relevant to the end user.

Changes

Verification

@fpetkovski fpetkovski requested a review from GiedriusS October 25, 2022 07:31
@fpetkovski fpetkovski force-pushed the short-circuit-chunk-dedup branch from cc4e607 to dd263fe Compare October 25, 2022 09:15
GiedriusS
GiedriusS previously approved these changes Oct 25, 2022
Copy link
Member

@GiedriusS GiedriusS left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 nice cleanup, need to rebase against newest main

The store proxy deduplicates series by calculating a hash on the each
aggregate chunk. If at least one hash is not present in the dedup-map,
the entire series chunk is added to the response, including all aggregate
chunks.

Because of this, we can short-circuit deduping chunks and stop early if any
aggregate chunks is not previously seen.

Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com>
@fpetkovski
Copy link
Contributor Author

Should be green now :)

@matej-g matej-g merged commit fec2b8c into thanos-io:main Oct 25, 2022
GiedriusS pushed a commit to vinted/thanos that referenced this pull request Oct 31, 2022
The store proxy deduplicates series by calculating a hash on the each
aggregate chunk. If at least one hash is not present in the dedup-map,
the entire series chunk is added to the response, including all aggregate
chunks.

Because of this, we can short-circuit deduping chunks and stop early if any
aggregate chunks is not previously seen.

Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com>

Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com>
ngraham20 pushed a commit to ngraham20/thanos that referenced this pull request May 18, 2023
The store proxy deduplicates series by calculating a hash on the each
aggregate chunk. If at least one hash is not present in the dedup-map,
the entire series chunk is added to the response, including all aggregate
chunks.

Because of this, we can short-circuit deduping chunks and stop early if any
aggregate chunks is not previously seen.

Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com>

Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants