Probe Statefulset Pods until healthy upon scale up #71

metalmatze · 2021-03-23T16:02:55Z

This is crucial to not send requests to receivers too early.
In combination with thanos-io/thanos#3845 for quicker reloading of Thanos Receive Routes I was able to achieve > 99% availability in errors+latency while scaling up from 3 replicas to 20.

metalmatze · 2021-03-23T16:08:31Z

Looking into linting failures.

kakkoyun

Awesome 💯 LGTM. We just need to make CI green, I guess.

kakkoyun · 2021-03-23T16:08:39Z

main.go

-			level.Info(logger).Log("msg", "caught interrupt")
-			close(sig)
-		})
+		g.Add(run.SignalHandler(context.Background(), os.Interrupt, syscall.SIGTERM))


jmichalek132 · 2021-03-23T16:24:37Z

One question about this why not instead of this just look at endpoints of a service pointing to the thanos-receive statefulset?

metalmatze · 2021-03-23T16:39:03Z

One question about this why not instead of this just look at endpoints of a service pointing to the thanos-receive statefulset?

Hm. Interesting. That might just work the same, yeah. I feel like this current approach gives us a little bit more control though. WDYT?

jmichalek132 · 2021-03-23T16:52:00Z

One question about this why not instead of this just look at endpoints of a service pointing to the thanos-receive statefulset?

Hm. Interesting. That might just work the same, yeah. I feel like this approach gives us a little bit more control though. WDYT?

The behavior would be slightly different, especially if you were to also remove the instance from configmap once it's removed from endpoints. But it would also respect other things such as failureThreshold,periodSeconds,..
Honestly I would prefer that but I an not sure about what consequences it had on thanos receive behavior.

metalmatze · 2021-03-23T16:55:36Z

I'll think about it a bit more.
If you were to hack a bit on this code to make it somewhat work I'm happy to run it with the benchmark suite again so we get some numbers to compare 👍

metalmatze · 2021-03-23T16:56:35Z

Seems like this fails now, because the current master is broken too..

jmichalek132 · 2021-03-23T17:56:12Z

I will find some time for it over the weekend. Is that okay?

spaparaju · 2021-03-24T11:38:44Z

For testing this PR (as the current tests are against a fake K8S client), I have maxed out cluster resources (on minikube each Thanos-receive pod starts with 512 MB memory) by scaling up Thanos-receive pods. With these cluster maxed out conditions, thanos-receive pods would not reach the indicated .spec.replicas and the Obs. hash-ring would need to reflect the URLs which are pointing only 'replicas in Ready status'. Here is the screenshot of the this test result where hash-ring is out of sync with the 'replicas in ready status'.

metalmatze · 2021-03-24T12:28:04Z

I think you're talking about a slightly different problem. More of an addition to this existing PR which we should be able to handle in another PR building on top of this. What do you think?

brancz · 2021-03-24T12:29:34Z

main.go

@@ -502,7 +507,26 @@ func (c *controller) sync() {
 			continue
 		}

+		// If there's an increase in replicas we poll for the new replicas to be ready
+		if _, ok := c.replicas[hashring]; ok && c.replicas[hashring] < *sts.Spec.Replicas {


For this type of thing to be truly safe to do, we'll need to eventually implement leader election to avoid that multiple controllers are reconciling this state.

I'm not aware that anyone has tried running multiple thanos-receive-controllers yet. If we are going to support that, then yes, that should be improved. :)

Agreed with Matthias. Right now, it's definitely assumed that the controller is only one replica. Even without this change, we would really need to implement some coordination to enable more replicas to have truly predictable results.

while unlikely, the case that I worry about more is rollout/eviction/preemption or some other reason why there may be temporarily two .. as I said, eventually this would be good, not necessary in this PR

Signed-off-by: Matthias Loibl <mail@matthiasloibl.com>

metalmatze · 2021-03-25T15:14:52Z

I'd be happy to go forward with this and take care of other improvements in future PRs.

brancz

lgtm 🎉 👍 but would be good to have another pair of eyes

kakkoyun

LGTM 🥇

christopherzli · 2024-01-25T08:21:45Z

just curious, this would not work when multiple share the same hashring right?

christopherzli · 2024-05-08T21:17:45Z

main.go

+
+				if err := c.waitForPod(podName); err != nil {
+					level.Warn(c.logger).Log("msg", "failed polling until pod is ready", "pod", podName, "duration", time.Since(start), "err", err)
+					continue


why do we continue here if failed polling? Shuoldn't we throw the error instead?

metalmatze requested review from squat and kakkoyun and removed request for squat March 23, 2021 16:02

metalmatze mentioned this pull request Mar 23, 2021

Update Hash ring only with the in-ready-status replicas of Statefulset #70

Closed

kakkoyun reviewed Mar 23, 2021

View reviewed changes

metalmatze force-pushed the scale-up branch from 36a6132 to 7b93fa3 Compare March 23, 2021 16:50

brancz reviewed Mar 24, 2021

View reviewed changes

Probe Statefulset Pods until healthy upon scale up

ba2613e

Signed-off-by: Matthias Loibl <mail@matthiasloibl.com>

metalmatze force-pushed the scale-up branch from 7b93fa3 to ba2613e Compare March 25, 2021 15:08

brancz approved these changes Mar 25, 2021

View reviewed changes

kakkoyun approved these changes Mar 26, 2021

View reviewed changes

kakkoyun merged commit 7770963 into master Mar 26, 2021

metalmatze deleted the scale-up branch March 30, 2021 12:02

jmichalek132 mentioned this pull request Nov 9, 2021

generate configmap based on endpoints #78

Closed

christopherzli reviewed May 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Probe Statefulset Pods until healthy upon scale up #71

Probe Statefulset Pods until healthy upon scale up #71

metalmatze commented Mar 23, 2021

metalmatze commented Mar 23, 2021

kakkoyun left a comment

kakkoyun Mar 23, 2021

jmichalek132 commented Mar 23, 2021

metalmatze commented Mar 23, 2021 •

edited

Loading

jmichalek132 commented Mar 23, 2021

metalmatze commented Mar 23, 2021

metalmatze commented Mar 23, 2021

jmichalek132 commented Mar 23, 2021 •

edited

Loading

spaparaju commented Mar 24, 2021 •

edited

Loading

metalmatze commented Mar 24, 2021

brancz Mar 24, 2021

metalmatze Mar 24, 2021

squat Mar 24, 2021

brancz Mar 25, 2021

metalmatze commented Mar 25, 2021

brancz left a comment

kakkoyun left a comment

christopherzli commented Jan 25, 2024

christopherzli May 8, 2024

Probe Statefulset Pods until healthy upon scale up #71

Probe Statefulset Pods until healthy upon scale up #71

Conversation

metalmatze commented Mar 23, 2021

metalmatze commented Mar 23, 2021

kakkoyun left a comment

Choose a reason for hiding this comment

kakkoyun Mar 23, 2021

Choose a reason for hiding this comment

jmichalek132 commented Mar 23, 2021

metalmatze commented Mar 23, 2021 • edited Loading

jmichalek132 commented Mar 23, 2021

metalmatze commented Mar 23, 2021

metalmatze commented Mar 23, 2021

jmichalek132 commented Mar 23, 2021 • edited Loading

spaparaju commented Mar 24, 2021 • edited Loading

metalmatze commented Mar 24, 2021

brancz Mar 24, 2021

Choose a reason for hiding this comment

metalmatze Mar 24, 2021

Choose a reason for hiding this comment

squat Mar 24, 2021

Choose a reason for hiding this comment

brancz Mar 25, 2021

Choose a reason for hiding this comment

metalmatze commented Mar 25, 2021

brancz left a comment

Choose a reason for hiding this comment

kakkoyun left a comment

Choose a reason for hiding this comment

christopherzli commented Jan 25, 2024

christopherzli May 8, 2024

Choose a reason for hiding this comment

metalmatze commented Mar 23, 2021 •

edited

Loading

jmichalek132 commented Mar 23, 2021 •

edited

Loading

spaparaju commented Mar 24, 2021 •

edited

Loading