Behavior when tags configured is not available or propagated #28

mesutcelik · 2017-05-29T21:22:15Z

We need to define a behavior if tags can't be retrieved for some period of time and fail fast at the end if tag can't be resolved.

see the issue --> hazelcast/hazelcast#10537

ghost · 2017-07-20T12:01:43Z

@mesutcelik we added configurable timeout for queries. I believe we can close this issue.

With PR: #30

mesutcelik · 2017-07-20T18:37:29Z

Can you please reference the PR here?

emrahkocaman · 2017-07-24T20:01:05Z

Closing this issue, resolved by #30. (This line to be specific.)

dimas · 2018-02-04T01:59:35Z

Chaps, are you sure the original issue is resolved by this PR?
This comment hazelcast/hazelcast#10537 (comment) explains how I see it - the important bit is not what timeout you have for AWS HTTP requests but how do you make sure if AWS response is good to use. (If the node cannot see itself in the response, it is probably just too early to trust that response - tags has not propagated yet...)

mesutcelik · 2018-02-06T17:46:35Z

I am reopening the issue. You basically say we need to retry if we see empty list here. It has to return at least itself in case of a single-member or first member case.

dimas · 2018-02-09T09:31:03Z

Yes, correct.
Thank you.
In our custom scripts we run before starting Hazelcast we do do not check that node can find itself, we only check if list is empty or not. Even if the node cannot find itself it is probably ok as long as it sees another one - it will try to cluster with it.

leszko · 2018-05-22T02:22:14Z

I think the detection of AWS tags are already propagated shouldn't be part of the responsibilities of the AWS Discovery. Tag is not propagated, so the instance is not discovered, which may result in the Split Brain, but the nodes will rediscover each other after some time.

After merging #64, it will work as follows.

Set tag on instances
Start Hazelcast members
Tags are no propagated yet, so members won't discover each other - each member will create its own cluster (Split Brain)
Tags are propagated
Hazelcast member will rediscover each other and create one cluster

Split Brain is an issue, however, I think it's the correct behavior, nodes don't see each other, so they don't connect. Or am I missing something?

Technically, we could check that the node detects itself and if it doesn't then wait some time and retry, however such behavior would cause other issues, because tags can be propagates with different delays, so we can end up with Split Brain anyway.

leszko · 2018-05-30T15:12:18Z

Moving the discussion started in #64 . Last comment from Mesut: "empty means to me either tags are not propagated as described in #28 or there is no tag configured although they are defined in hazelcast config.
It is good idea to move the discussion to #28"

leszko · 2018-05-30T15:17:10Z

@mesutcelik So you described two cases:

Tags are (yet) not propagated => the nodes will start separately (Split Brain) and rejoin when the tags are propagated (I think it's the correct behavior).
No tag is configured, but they are defined in hazelcast config (misconfiguration) - it would be nice to give a user a meaningful message how to fix it, however I don't think it's possible to distinguish between it and point 1.

So IMO, the current behavior is fine. I'd just test that it's fine and (maybe) document it. Wdyt?

mesutcelik · 2018-05-30T16:36:05Z

The original issues created is hazelcast/hazelcast#10537 (comment) and the delay mechanism that is used by @dimas was probably because they didn't want to see temporary Split Brain.

@dimas Can you please comment on this?

leszko · 2018-07-09T13:34:55Z

Closing due to inactivity.

dimas · 2018-07-11T14:27:49Z

How temporary is Temporary Split Brain? I am not sure cluster ever healed in our case but, again, it was years ago.

leszko · 2018-07-12T07:27:28Z

@dimas , I think in your case it didn't retried, but it's fixed now #71.

The possible "Temporary Split Brain" should take up to a few minutes after tags are propagated (I tried and in my case it took 4 min).

mesutcelik mentioned this issue May 29, 2017

Delaying AWS discovery until tags propagate hazelcast/hazelcast#10537

Closed

emrahkocaman added this to the 2.0.2 milestone Jul 24, 2017

emrahkocaman closed this as completed Jul 24, 2017

mesutcelik reopened this Feb 6, 2018

mesutcelik mentioned this issue Apr 9, 2018

AwsDiscoveryStrategy.discoverNodes throws exception causes node to shutdown immediately #64

Closed

degerhz modified the milestones: 2.1, 2.2 Apr 17, 2018

leszko added Type: Enhancement Estimation: M labels Apr 30, 2018

dbrimley added the Priority: High label May 3, 2018

leszko mentioned this issue May 22, 2018

Issue 64 #71

Merged

leszko closed this as completed Jul 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Behavior when tags configured is not available or propagated #28

Behavior when tags configured is not available or propagated #28

mesutcelik commented May 29, 2017

ghost commented Jul 20, 2017 •

edited by ghost

Loading

mesutcelik commented Jul 20, 2017

emrahkocaman commented Jul 24, 2017

dimas commented Feb 4, 2018 •

edited

Loading

mesutcelik commented Feb 6, 2018

dimas commented Feb 9, 2018

leszko commented May 22, 2018 •

edited

Loading

leszko commented May 30, 2018

leszko commented May 30, 2018 •

edited

Loading

mesutcelik commented May 30, 2018

leszko commented Jul 9, 2018

dimas commented Jul 11, 2018

leszko commented Jul 12, 2018

Behavior when tags configured is not available or propagated #28

Behavior when tags configured is not available or propagated #28

Comments

mesutcelik commented May 29, 2017

ghost commented Jul 20, 2017 • edited by ghost Loading

mesutcelik commented Jul 20, 2017

emrahkocaman commented Jul 24, 2017

dimas commented Feb 4, 2018 • edited Loading

mesutcelik commented Feb 6, 2018

dimas commented Feb 9, 2018

leszko commented May 22, 2018 • edited Loading

leszko commented May 30, 2018

leszko commented May 30, 2018 • edited Loading

mesutcelik commented May 30, 2018

leszko commented Jul 9, 2018

dimas commented Jul 11, 2018

leszko commented Jul 12, 2018

ghost commented Jul 20, 2017 •

edited by ghost

Loading

dimas commented Feb 4, 2018 •

edited

Loading

leszko commented May 22, 2018 •

edited

Loading

leszko commented May 30, 2018 •

edited

Loading