Skip to content
This repository has been archived by the owner on Nov 15, 2023. It is now read-only.

Lots of alerts about task that has ended unexpectedly #3786

Closed
eskimor opened this issue Aug 30, 2021 · 5 comments · Fixed by paritytech/substrate#9882
Closed

Lots of alerts about task that has ended unexpectedly #3786

eskimor opened this issue Aug 30, 2021 · 5 comments · Fixed by paritytech/substrate#9882

Comments

@eskimor
Copy link
Member

eskimor commented Aug 30, 2021

On Kusama we get logs of alerts:

 Continuous task large-statement-fetcher on node kusama-validator-aws-ue1-1-int.aws.parity.io:9615 ended unexpectedly.

producing a lot of noise. I believe @tomaka mentioned that there is some heuristic to detect whether a task is supposed to stay running or is short lived. In this particular case the task is short lived (spawned to fetch some data, once it succeeds it will stop), so the heuristic seems to be wrong.

@eskimor eskimor changed the title Lot's of alerts about task that has ended unexpectedly Lots of alerts about task that has ended unexpectedly Aug 30, 2021
@gilescope
Copy link
Contributor

Has this recently changed or been a constant niggle? I'm just wondering if we suddenly have a lot more large statements than usual (i.e. after we have rolled out some code change)?

@bkchr
Copy link
Member

bkchr commented Sep 4, 2021

This has already been discussed multiple times, the problem is that the alerting rules live in the substrate repo: https://github.com/paritytech/substrate/blob/master/.maintain/monitoring/alerting-rules/alerting-rules.yaml

However, it doesn't make any sense because they are mainly polkadot specific. We should finally move them to polkadot and modify them properly.

@bkchr
Copy link
Member

bkchr commented Sep 4, 2021

But this is really not Substrate code related.

@bkchr bkchr transferred this issue from paritytech/substrate Sep 4, 2021
@tomaka
Copy link
Contributor

tomaka commented Sep 5, 2021

We could simply remove this alert altogether. As far as I can remember it has only once detected an actual issue with a task. All the other alert triggers have been false positives.

@eskimor
Copy link
Member Author

eskimor commented Sep 6, 2021

Has this recently changed or been a constant niggle? I'm just wondering if we suddenly have a lot more large statements than usual (i.e. after we have rolled out some code change)?

That is actually a real good question. We should mostly have large statements on parachain runtime upgrades (or on large messages).

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants