-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
compactor is in infinite loop when broken block #621
Comments
Yup, valid issue. But there was something related regarding partial block: #377 |
Hm looks like moving or renaming not possible in s3, i think if all components will skip partially uploaded blocks it will work, but not sure |
@bwplotka ,
Maybe you have some plan on how to fix it. Please let me know |
Hey, wow quite a long time from the initial response, sorry for delay. The way we want to solve this is specified here: https://github.com/improbable-eng/thanos/blob/master/docs/proposals/approved/201901-read-write-operations-bucket.md |
.. there is no timeline on above one, so we need some faster fix to partial blocks... The root cause of this issue is compactor creashed/restarted in the middle of upload and did not have time to finish it. We need to handle this case. |
Fix: #1053 |
Thanos, Prometheus and Golang version used
What happened
i tried to reproduce situation when compactor is shutdown in the middle of uploading new compacted blocks. The problem is if the block upload partly if compactor is start again it stuck on process of syncing metas. I think it possible to add check if block is corrupted just remove it from the list of queryable blocks or in this particular case just skip it
What you expected to happen
i expected to see that broken blocks is skipped and print error or warning that one of blocks is broken
How to reproduce it (as minimally and precisely as possible):
start compactor what until it compact and start to upload and in the middle of process just kill compactor
Full logs to relevant components
Anything else we need to know
The text was updated successfully, but these errors were encountered: