-
Notifications
You must be signed in to change notification settings - Fork 24.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deadlock in FileSettingsService #92812
Comments
Pinging @elastic/es-core-infra (Team:Core/Infra) |
The deadlock is (on master thread) |
@DaveCTurner has worked out why this didn't happen in our test cases... The relevant code path is only hit when we set initial state timeout to zero (which we believe ECK has)1. The This is the patch to reproduce this in the integration tests:
Footnotes
|
The
FileSettingsService
blocks while waiting for some async calls to happen. On a newly-elected master, this blocking happens on the cluster applier thread, preventing the master service from completing its current publication:The trouble is, one of the things on which we're waiting is another cluster state update:
elasticsearch/server/src/main/java/org/elasticsearch/reservedstate/service/FileSettingsService.java
Line 425 in 8ae63c3
This update can never complete, because it needs the blocked applier thread to unblock, completing the current publication.
Relates elastic/cloud-on-k8s#6303
Workaround
Remove the settings file(s) and restart the master.
The text was updated successfully, but these errors were encountered: