Replies: 2 comments 19 replies
-
Thanks for sharing this doc @xBis7 Do you have a jira for this? |
Beta Was this translation helpful? Give feedback.
-
@devmadhuu @sumitagrawl To sum up the doc. The current idea about cleaning up a missing container is There is a CLI command
Recon's periodic health task picks up that the unhealthy container no longer exists in the SCM and cleans up the tables. Health task doesn't see that the container is deleted but that it can't be found, it's lost entirely from the SCM and there is a mismatch. We can't update a container's state when the datanode that has the container isn't available but we can have an implementation for updating the state just in the SCM, instead of deleting the container. The background service idea is similar to Community syncWhat I got from the community meeting discussion is that before deleting the data from Recon we should move them to new tables so that the user can inspect them at a later point in time. This way the data will still be available and the UI will appear normal again. I also got that we might need to reconsider key deletion. Keys can spread across multiple containers and their remaining blocks might still be recoverable. |
Beta Was this translation helpful? Give feedback.
-
There is insufficient functionality in Ozone for cleaning up missing containers. Once a container is missing then it stays in the system forever.
I have been looking into providing a way to address this issue and I have written a document. If anyone is interested in this, please take a look at the doc and put comments on it. I'd appreciate any feedback before finalizing the implementation.
doc: Ozone Missing Containers Cleanup
Beta Was this translation helpful? Give feedback.
All reactions