In scaled out environments: define a policy to report stopped heartbeats only when instances count goes below a threshold #3317

mauroservienti · 2022-12-13T07:12:04Z

Allow defining a policy that sets/configs some minimum instance count and only reports when the number of instances goes below it. Instead of reporting stopped heartbeats for every instance.

eirikurharaldsson · 2023-11-06T14:03:47Z

Is there any update on this?

mauroservienti · 2023-11-13T10:12:38Z

Thanks for chiming in, @eirikurharaldsson. At the moment, there isn't an ETA or an update. Being labeled as candidate-for-next-release, it might be picked up anytime soon.

eirikurharaldsson · 2024-09-19T13:09:28Z

Is this supported in ServiceControl 5?

eirikurharaldsson · 2024-09-19T13:48:25Z

Can i override the host identifier, so that the name is stable like dev1, dev2, dev3, test1, test2...
(make it unique per container stage (dev, test, prod).
https://docs.particular.net/nservicebus/hosting/override-hostid

I noticed this issue. Do you know if heartbeat plugin support override of the hostid?
Particular/NServiceBus#7026

awright18 · 2024-09-19T19:15:38Z

The latest version of the platform (ServicePulse 1.42.1 and ServiceControl 5.9) included this fix. Can you let us know if that helps in your scenario?

johnsimons · 2024-10-18T05:32:10Z

@mauroservienti I think we can close this one now that we handle scaled-out environments better, thoughts?

mauroservienti · 2024-10-23T10:04:37Z

I'm not sure about it, @johnsimons. This is more about defining a lower bound threshold. The feedback was more about "I don't want to be bothered by an alarm if my instances count is still greater than X."

johnsimons · 2024-10-24T01:37:26Z

@mauroservienti regarding "I don't want to be bothered by an alarm if my instances count is still greater than X."

So, the new functionality enables this scenario,

I don't want to be bothered by an alarm if my instances count is still greater than 1.

It does not allow customers to set a >1 lower bound, which we discussed in the internal RFC.
The decision was that it felt like we would be reinventing what K8s does out of the box, and it would also have the risk of possible maintenance burden on customers to ensure the replica number is kept in sync with what is in K8s.

So, given we now handle scaled-out environments better, I feel we can close this issue, and if this surfaces again, we can re-evaluate it at that time, thoughts?

johnsimons · 2024-10-28T00:06:40Z

Based on my previous comment, I am closing this one.

udidahan · 2024-11-03T10:38:56Z

@eirikurharaldsson if you feel that we missed something, please do let us know and re-open the issue. Thanks.

eirikurharaldsson · 2024-11-04T10:55:35Z

If this only raises an alarm if no instance of the same endpoint is running,
and does not leave dead replicas in inactive heartbeat endpoints, then it´s fine by us.
We will upgrade to the new version of ServiceControl asap.
Thanks.

mauroservienti added Feature candidate-for-next-release labels Dec 13, 2022

johnsimons removed the candidate-for-next-release label Oct 10, 2024

johnsimons closed this as completed Oct 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In scaled out environments: define a policy to report stopped heartbeats only when instances count goes below a threshold #3317

In scaled out environments: define a policy to report stopped heartbeats only when instances count goes below a threshold #3317

mauroservienti commented Dec 13, 2022 •

edited

Loading

eirikurharaldsson commented Nov 6, 2023

mauroservienti commented Nov 13, 2023

eirikurharaldsson commented Sep 19, 2024

eirikurharaldsson commented Sep 19, 2024

awright18 commented Sep 19, 2024 •

edited

Loading

johnsimons commented Oct 18, 2024

mauroservienti commented Oct 23, 2024

johnsimons commented Oct 24, 2024

johnsimons commented Oct 28, 2024

udidahan commented Nov 3, 2024

eirikurharaldsson commented Nov 4, 2024

In scaled out environments: define a policy to report stopped heartbeats only when instances count goes below a threshold #3317

In scaled out environments: define a policy to report stopped heartbeats only when instances count goes below a threshold #3317

Comments

mauroservienti commented Dec 13, 2022 • edited Loading

eirikurharaldsson commented Nov 6, 2023

mauroservienti commented Nov 13, 2023

eirikurharaldsson commented Sep 19, 2024

eirikurharaldsson commented Sep 19, 2024

awright18 commented Sep 19, 2024 • edited Loading

johnsimons commented Oct 18, 2024

mauroservienti commented Oct 23, 2024

johnsimons commented Oct 24, 2024

johnsimons commented Oct 28, 2024

udidahan commented Nov 3, 2024

eirikurharaldsson commented Nov 4, 2024

mauroservienti commented Dec 13, 2022 •

edited

Loading

awright18 commented Sep 19, 2024 •

edited

Loading