Restarting the application disables monitors with interval greater than application uptime #3504

alexklibisz · 2023-07-30T16:32:16Z

❗ ❗ For those just skimming, the solution was: Push monitors get reset when the uptime-kuma application reboots. So if you restart your application at some interval (e.g., for a backup), then it will disable any push monitors which have a greater interval. In my case, I had a 25 hour push monitor and I was restarting the application once every 24 hours for a backup. I just stopped restarting the application for the backup, and the push monitors work fine again.

⚠️ Please verify that this bug has NOT been raised before.

I checked and didn't find similar issue

🛡️ Security Policy

I agree to have read this project Security Policy

Description

I have a push monitor set to a 90000 second (25 hours) interval. I have a script that runs once/day and curls the push monitor URL upon successful completion. I disabled the script and noticed that I did not get any alert.

Here's the configuration:

I have the same setup for some monitors/scripts on 60 second intervals, and they all correctly trigger when the script does not run.

👟 Reproduction steps

Setup a push monitor w/ a 90000 second interval
Curl the monitor URL once
Never curl it again
The push monitor never goes red

👀 Expected behavior

If the push URL does not receive a request in 25 hours, it should trip the alert. In other words, a push monitor should behave the same regardless its interval.

😓 Actual Behavior

See description

🐻 Uptime-Kuma Version

1.22.1-debian

💻 Operating System and Arch

Ubuntu 22.04

🌐 Browser

Chronium 114.0.5735.198

🐋 Docker Version

23.0.5, build bc4487a

🟩 NodeJS Version

No response

📝 Relevant log output

No response

louislam · 2023-07-30T18:23:03Z

Set Retries to 0.

alexklibisz · 2023-07-30T18:36:42Z

Thanks, I'll try that. Could you explain why this is necessary?

louislam · 2023-07-30T18:50:21Z

With Retries=1, the monitor will not send a notification on the first failed check. It will check one more time to confirm that. In your case, it will send in 50 hours.

If you want to keep the retries logic, you can lower Heartbeat Retry Interval to maybe 60 seconds, so it won't take too long to retry.

alexklibisz · 2023-07-30T18:51:32Z

Hmm. The problem was that it just never sends the notification. Even if the script is down for weeks.

chakflying · 2023-07-30T19:15:48Z

I swear I have seen this before. After a bit (a lot) of digging I finally found this: #2801

I will setup a push monitor to try to test this myself.

louislam · 2023-07-30T19:43:18Z

I swear I have seen this before. After a bit (a lot) of digging I finally found this: #2801

I will setup a push monitor to try to test this myself.

Oh, it's the push monitor again, I feed this monitor type's implementation becomes over complicated somehow...

Change back to bug, did not realize the monitor type.

alexklibisz · 2023-08-01T15:14:19Z

Thanks @louislam . Just a bit of feedback: as a user, I have found the Retries and Heartbeat Retry Interval parameters confusing. I'm not sure what it means to "retry" a push monitor. The monitor is just waiting for an HTTP request. I can't think of anything that it could be retrying.

alexklibisz · 2023-08-01T15:26:45Z

I think I might have found one contributing factor to the issue. I normally run a backup of my uptime-kuma container that involves stopping and restarting the container. I disabled this backup, created a new 25-hour alert, curled it once, and then left it alone. I got an email alert this time. The only difference compared to my previous setup is that I did not re-start the container. @louislam is there perhaps anything in the push metric code that would be affected by a container restart?

chakflying · 2023-08-01T16:28:27Z

Wait then this may be more dumb than I thought... On server start, we schedule a task after your defined interval to check if the push route has been called.

If you set your interval to 25 hours, then restart the server every 24 hours, then obviously the task will never get to fire 🤦🏻‍♂️

alexklibisz · 2023-08-01T16:45:32Z

Got it, that makes sense. I don't know if it's particularly obvious, though. Many applications can gracefully handle restarts without affecting behavior. I had assumed this was the case for uptime-kuma.

chakflying · 2023-08-01T17:21:31Z

I agree, we should handle this better. I meant that as I expected this to be a week long debugging session involving tracking down platform-dependent or race condition issues. Turns out it's way more "obvious" than that.

I think on restart we can compute the remaining interval like I did in #3072, then schedule the next check base on that. But it would be a slight change from the current behavior as people who have their interval set at 60s would likely see their push monitor immediately go Down after restart.

Also currently retries do not persisting across restart, and that's more difficult to fix.

CommanderStorm · 2023-08-01T17:54:26Z

likely see their push monitor immediately go Down after restart.

Do you think we need to introduce a minimum time in this case?
(Could the improvement you are thinking about solve #454 as well?)

alexklibisz · 2023-08-05T15:42:04Z

This has been running well since I stopped doing the nightly reboot. Feel free to close this issue. I'll be excited to see a fix at some point, but no rush.

chakflying · 2023-08-09T11:33:11Z

You can also change the issue title to be more descriptive of what the actual problem is, maybe we can keep this open until a fix is available.

alexklibisz · 2023-08-09T13:20:48Z

You can also change the issue title to be more descriptive of what the actual problem is, maybe we can keep this open until a fix is available.

I changed the title and will close.

@louislam feel free to re-open if this is a good place to track the fix.

alexklibisz added the bug Something isn't working label Jul 30, 2023

louislam added help and removed bug Something isn't working labels Jul 30, 2023

louislam added bug Something isn't working and removed help labels Jul 30, 2023

alexklibisz changed the title ~~Push monitor w/ interval > 1 day does not seem to work~~ Restarting the application disables monitors with interval greater than application uptime Aug 9, 2023

alexklibisz closed this as completed Aug 9, 2023

CommanderStorm mentioned this issue Aug 12, 2023

uptime kuma don't send noification with a large attempts #3392

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restarting the application disables monitors with interval greater than application uptime #3504

Restarting the application disables monitors with interval greater than application uptime #3504

alexklibisz commented Jul 30, 2023 •

edited

Loading

louislam commented Jul 30, 2023

alexklibisz commented Jul 30, 2023

louislam commented Jul 30, 2023

alexklibisz commented Jul 30, 2023 •

edited

Loading

chakflying commented Jul 30, 2023

louislam commented Jul 30, 2023 •

edited

Loading

alexklibisz commented Aug 1, 2023

alexklibisz commented Aug 1, 2023

chakflying commented Aug 1, 2023

alexklibisz commented Aug 1, 2023

chakflying commented Aug 1, 2023

CommanderStorm commented Aug 1, 2023 •

edited

Loading

alexklibisz commented Aug 5, 2023

chakflying commented Aug 9, 2023

alexklibisz commented Aug 9, 2023

Restarting the application disables monitors with interval greater than application uptime #3504

Restarting the application disables monitors with interval greater than application uptime #3504

Comments

alexklibisz commented Jul 30, 2023 • edited Loading

⚠️ Please verify that this bug has NOT been raised before.

🛡️ Security Policy

Description

👟 Reproduction steps

👀 Expected behavior

😓 Actual Behavior

🐻 Uptime-Kuma Version

💻 Operating System and Arch

🌐 Browser

🐋 Docker Version

🟩 NodeJS Version

📝 Relevant log output

louislam commented Jul 30, 2023

alexklibisz commented Jul 30, 2023

louislam commented Jul 30, 2023

alexklibisz commented Jul 30, 2023 • edited Loading

chakflying commented Jul 30, 2023

louislam commented Jul 30, 2023 • edited Loading

alexklibisz commented Aug 1, 2023

alexklibisz commented Aug 1, 2023

chakflying commented Aug 1, 2023

alexklibisz commented Aug 1, 2023

chakflying commented Aug 1, 2023

CommanderStorm commented Aug 1, 2023 • edited Loading

alexklibisz commented Aug 5, 2023

chakflying commented Aug 9, 2023

alexklibisz commented Aug 9, 2023

alexklibisz commented Jul 30, 2023 •

edited

Loading

alexklibisz commented Jul 30, 2023 •

edited

Loading

louislam commented Jul 30, 2023 •

edited

Loading

CommanderStorm commented Aug 1, 2023 •

edited

Loading