-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Behavior when tags configured is not available or propagated #28
Comments
@mesutcelik we added configurable With PR: #30 |
Can you please reference the PR here? |
Chaps, are you sure the original issue is resolved by this PR? |
I am reopening the issue. You basically say we need to retry if we see empty list here. It has to return at least itself in case of a single-member or first member case. |
Yes, correct. |
I think the detection of AWS tags are already propagated shouldn't be part of the responsibilities of the AWS Discovery. Tag is not propagated, so the instance is not discovered, which may result in the Split Brain, but the nodes will rediscover each other after some time. After merging #64, it will work as follows.
Split Brain is an issue, however, I think it's the correct behavior, nodes don't see each other, so they don't connect. Or am I missing something? Technically, we could check that the node detects itself and if it doesn't then wait some time and retry, however such behavior would cause other issues, because tags can be propagates with different delays, so we can end up with Split Brain anyway. |
@mesutcelik So you described two cases:
So IMO, the current behavior is fine. I'd just test that it's fine and (maybe) document it. Wdyt? |
The original issues created is hazelcast/hazelcast#10537 (comment) and the delay mechanism that is used by @dimas was probably because they didn't want to see temporary Split Brain. @dimas Can you please comment on this? |
Closing due to inactivity. |
How temporary is Temporary Split Brain? I am not sure cluster ever healed in our case but, again, it was years ago. |
We need to define a behavior if tags can't be retrieved for some period of time and fail fast at the end if tag can't be resolved.
see the issue --> hazelcast/hazelcast#10537
The text was updated successfully, but these errors were encountered: