-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Thanos Receive can create overlapping blocks #5461
Comments
Hello 👋 Looks like there was no activity on this issue for the last two months. |
@fpetkovski are you seeing increase in persistent volume utilization as well for this particular receiver instance/pod? |
Unfortunately I don't have the data anymore, and we resolve the problem by enabling overlapping blocks. |
It looks like receive overlapping blocks is enbaled by default with prometheus 2.39+ dependencies: I think it would be worth to cleanup the flag now. BTW I am experiencing some compaction issues with blocks uploaded to buckets. And then, what can we do with this on compactor? is it a new issue I should open?
thank you, |
hum it looks like I have to ignore compaction for OOO blocks with flag introduced with #3442
|
Thanos Receiver can fail head compaction due to overlapping blocks. We have seen this happen in production and it caused an instance of Receiver to completely halt compaction and accumulate memory forever.
The root cause could be a race condition bug in TSDB itself: prometheus/prometheus#8055
It is possible to allow overlapping blocks in TSDB, but maybe this should be enabled by default since Thanos Query can deal with this situation. This option for enabling vertical blocks was added here: #3792
Cortex seems to solve the issue by preventing ingestion during head compaction: cortexproject/cortex#3422
Thanos version used: v0.26
What happened:
Receiver failed to compact the head. This graph shows that memory accumulation started at 01:00:00, exactly after compaction is triggered.
What you expected to happen:
Receiver successfully compacts the head and frees up memory.
How to reproduce it (as minimally and precisely as possible):
Haven't been able to reproduce it locally yet.
Full logs to relevant components:
The text was updated successfully, but these errors were encountered: