-
Notifications
You must be signed in to change notification settings - Fork 448
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support differentiating between static and dynamic resource groups #5162
Comments
From what I can tell, the logic in This information is only shown on the Monitor. We could add a Monitor property that is a list of resource groups to ignore, then use that when calling |
Another way to approach this would be to remove the tablet servers ZooKeeper entry on a normal shutdown. I believe that the tserver did not do this in earlier (< 4.0) versions because the configuration when using cluster.yaml because the deployment was meant to be static. In fact, this is a larger issue when a users uses something like Kubernetes. It's possible that old server paths might linger for all the server processes. We may want to consider removing the server paths from ZooKeeper for all server types when performing a normal shutdown. |
In the ServiceLock.unlock code (below) in the case of Compactors, Scan Servers, and TabletServers We could modify accumulo/core/src/main/java/org/apache/accumulo/core/lock/ServiceLock.java Lines 531 to 558 in e5370d1
|
I created #5226 to suppress the display of dead servers for configured resource groups in the Monitor. I think this is the simplest approach at this point. My earlier comments about removing the lock in ZooKeeper may not fully resolve the issue you raise. |
Agreed, I think that #5226 is a good solution for this issue as it's not tied to the cluster.yaml definition and also covers the situation where the user might want to shut down tservers faster than the "graceful shutdown" process allows and would end up generating dead tserver locks. |
In the past accumulo has tracked dead tservers.
For resource groups that have dynamic scaling workloads, there should be an option to disable this tracking of "dead" servers.
We probably don't want to remove this functionality entirely because there might be a static group of tservers which could have issues and failures for those machines should be tracked.
The text was updated successfully, but these errors were encountered: