-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG][Segment Replication] testClusterGreenAfterPartialRelocationNoPreferenceShardMovementPrimaryFirstEnabled test suite timeout #9178
Comments
The timeout is being caused in this test due to the latch countdown not being released. In the case of this case, the deprecated setting In this case, we respect the However, the latch countdown never occurs since some shard is prematurely being closed when it's expected to be in started state on the new zone. This causes a timeout waiting for the latch and a thread leak in further tests. |
When the applyClusterState is called on a clusterChangedEvent, it calls removeShards to remove any local shards that doesn't match what the cluster-manager expects. |
Issue: Throttling of primaries wasn't being recognized in the case of |
Describe the bug
org.opensearch.cluster.routing.ShardMovementStrategyTests.testClusterGreenAfterPartialRelocationNoPreferenceShardMovementPrimaryFirstEnabled is occasionally running into a test suite timeout.
This is also causing a related failure of a thread leak in this test class:
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Test should pass within timeout.
Plugins
Please list all plugins currently enabled.
Screenshots
If applicable, add screenshots to help explain your problem.
Host/Environment (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: