-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Potential index corruption on exit #198
Comments
What version of the parity-db crate is used in the project? Is there a way to reproduce this?
This is basically equivalent to deleting the whole database. |
the latest version, 0.4.6 TBH, I tested paritydb on dozens of machines, with no issue, so I don't know how these users met the problem.
Those indexes do seem not quite big, only ~ 2G, total DB are 47G one of the reporters give me his bad indexes, I'm not sure it would help, could you help to exam it? https://storage.googleapis.com/phala-misc/phala-node-db-index.zip |
they're using
The binary is based on Phala-Network/khala-parachain#263 I already tested paritydb at least 6 months on dozens of machines, with no issue, so I'm not sure it can stably reproduce... |
It's hard to sat what's going on looking at screenshots. |
so the fix is paritytech/cumulus#2461 I'll try to backport that and let users retry, if no issue, I'll close this issue |
We (maintainers of parity-db) are not aware of any serious issues. We would appreciate a bug report with logs and samples of broken databases. This particualr issue seems to be caused by some inefficiencies in cumulus. |
I just backported paritytech/cumulus#2461 and confirm the node is no more stuck on boot. Sorry for wasting your time, and thank you for your patience! |
I'm promoting our user (Phala) switch to ParityDB, then I got 3 dedicated reports that after a reboot (normal exit then restart I believe), the node is stuck on
Is collating: No
, according tohtop
, no IO, a CPU core is 100%, enable RUST_LOG="DEBUG" no extra infoA reporter just tries to delete all
index_
files, it seems those indexes are rebuilt, and the node is back to normal.The text was updated successfully, but these errors were encountered: