Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Monitor showing offline but its online #1848

Closed
2 tasks done
Kaseax opened this issue Jun 28, 2022 · 40 comments
Closed
2 tasks done

Monitor showing offline but its online #1848

Kaseax opened this issue Jun 28, 2022 · 40 comments

Comments

@Kaseax
Copy link

Kaseax commented Jun 28, 2022

⚠️ Please verify that this bug has NOT been raised before.

  • I checked and didn't find similar issue

🛡️ Security Policy

📝 Describe your problem

I have the problem, that all of my monitors except 1 showing as offline. i can reach the website normally.
Can someone help me by that?

One Monitor showing "error request aborted"

The other two showing "timeout of 48000ms exceeded"

🐻 Uptime-Kuma Version

1.16.1

💻 Operating System and Arch

Debian 10 x86_64

🌐 Browser

Microsoft Edge 103.0.1264.37

🐋 Docker Version

20.10.17

🟩 NodeJS Version

No response

@Kaseax Kaseax added the help label Jun 28, 2022
@GVALFER
Copy link

GVALFER commented Jun 28, 2022

Check if you can ping it directly from your server via ssh

@Kaseax
Copy link
Author

Kaseax commented Jun 28, 2022

You mean with the ping command? if yes, yes. i can ping every server

@dariusdumitru
Copy link

I got the same exact problem every morning at 5-6 AM for an hour. It says that it's down, but it's not. With the same error "timeout of 48000ms exceeded". Usually there are 2 websites, but it happens to be only 1 or 3 sometimes. I've increased the Heartbeat Interval and Retries, but no luck. Anyone can help?

@d-delaey
Copy link

I got the same exact problem every morning at 5-6 AM for an hour. It says that it's down, but it's not. With the same error "timeout of 48000ms exceeded". Usually there are 2 websites, but it happens to be only 1 or 3 sometimes. I've increased the Heartbeat Interval and Retries, but no luck. Anyone can help?

We got the same thing last night for maybe 40-50 Websites.

@Kaseax
Copy link
Author

Kaseax commented Jun 29, 2022

With me it is not only over night but permanent.

@Kaseax
Copy link
Author

Kaseax commented Jul 1, 2022

Now my Discord-Webhook showing sometimes, that the Services are online but then it says there are offline.

@Nuklon
Copy link

Nuklon commented Jul 5, 2022

Same issue here, see: #1621

@kfra71
Copy link

kfra71 commented Aug 2, 2022

Same issue here, suddenly we are getting "timeout of 48000ms exceeded" on most of the checks.

Anyone has any clue?

@louislam
Copy link
Owner

louislam commented Aug 2, 2022

Same issue here, suddenly we are getting "timeout of 48000ms exceeded" on most of the checks.

Anyone has any clue?

This may help:
https://github.com/louislam/uptime-kuma/wiki/Troubleshooting

@kfra71
Copy link

kfra71 commented Aug 2, 2022 via email

@Nuklon
Copy link

Nuklon commented Aug 2, 2022

Yeah, that works for me as well. It's really that Uptime Kuma gets bogged down, and it happens more frequently when the container is running for a longer period of time. I've not found anything useful in the logs either, it just times out.

@Nuklon
Copy link

Nuklon commented Aug 2, 2022

30s monitor (both monitors have the same settings and site, except for the interval):
image

1m monitor:
image

@guydeschepper
Copy link

Same issue here.
When I try to ping the url from within the docker container, as suggested here, I also do get a timeout.

BUT... the same website can be reached and pinged from other systems AND I can still ping other monitored websites from within the Uptime Kuma docker container, so it's not a general networking issue with the container / server / hosting.

So it seems something is getting bogged down in Uptime Kuma itself.

Note
Most of our monitored websites are behind Cloudflare, but some are not.
And until now I only experienced the issue with websites that are not behind Cloudflare.
Though not all those "non-cloudflare" websites are affected at the same time.
At the moment 2 of the non-cloudflare websites are affected, but several weeks ago we had the same issue with one of those 2 websites (and another one), but not the other.

We also see different errors prior to the timeout issues:

140059089770432:error:1408F119:SSL routines:ssl3_get_record:decryption failed or bad record mac:../deps/openssl/openssl/ssl/record/ssl3_record.c:677

and

getaddrinfo EAI_AGAIN <hostname>

@d-delaey
Copy link

Same issue here. When I try to ping the url from within the docker container, as suggested here, I also do get a timeout.

BUT... the same website can be reached and pinged from other systems AND I can still ping other monitored websites from within the Uptime Kuma docker container, so it's not a general networking issue with the container / server / hosting.

So it seems something is getting bogged down in Uptime Kuma itself.

Note Most of our monitored websites are behind Cloudflare, but some are not. And until now I only experienced the issue with websites that are not behind Cloudflare. Though not all those "non-cloudflare" websites are affected at the same time. At the moment 2 of the non-cloudflare websites are affected, but several weeks ago we had the same issue with one of those 2 websites (and another one), but not the other.

We also see different errors prior to the timeout issues:

140059089770432:error:1408F119:SSL routines:ssl3_get_record:decryption failed or bad record mac:../deps/openssl/openssl/ssl/record/ssl3_record.c:677

and

getaddrinfo EAI_AGAIN <hostname>

Hey, we "fixed" that issue by simply not using docker. But sometimes we are still getting the timeout error but not as often as using docker.

@guydeschepper
Copy link

@d-delaey thanks for the suggestion, but we're running Uptime Kuma as part of a dockerized environment, so not using docker is not an option for us in this case.

@github-actions
Copy link

We are clearing up our old issues and your ticket has been open for 3 months with no activity. Remove stale label or comment or this will be closed in 2 days.

@github-actions github-actions bot added the Stale label Dec 25, 2022
@guydeschepper
Copy link

Please don't close this issue as it is still not fixed!

@github-actions github-actions bot removed the Stale label Dec 25, 2022
@louislam
Copy link
Owner

Same issue here. When I try to ping the url from within the docker container, as suggested here, I also do get a timeout.

BUT... the same website can be reached and pinged from other systems AND I can still ping other monitored websites from within the Uptime Kuma docker container, so it's not a general networking issue with the container / server / hosting.

So it seems something is getting bogged down in Uptime Kuma itself.

Note Most of our monitored websites are behind Cloudflare, but some are not. And until now I only experienced the issue with websites that are not behind Cloudflare. Though not all those "non-cloudflare" websites are affected at the same time. At the moment 2 of the non-cloudflare websites are affected, but several weeks ago we had the same issue with one of those 2 websites (and another one), but not the other.

We also see different errors prior to the timeout issues:

140059089770432:error:1408F119:SSL routines:ssl3_get_record:decryption failed or bad record mac:../deps/openssl/openssl/ssl/record/ssl3_record.c:677

and

getaddrinfo EAI_AGAIN <hostname>

Make sure you are not using Alpine based image.

For SSL error, please double check http:// vs https://

@guydeschepper
Copy link

Same issue here. When I try to ping the url from within the docker container, as suggested here, I also do get a timeout.
BUT... the same website can be reached and pinged from other systems AND I can still ping other monitored websites from within the Uptime Kuma docker container, so it's not a general networking issue with the container / server / hosting.
So it seems something is getting bogged down in Uptime Kuma itself.
Note Most of our monitored websites are behind Cloudflare, but some are not. And until now I only experienced the issue with websites that are not behind Cloudflare. Though not all those "non-cloudflare" websites are affected at the same time. At the moment 2 of the non-cloudflare websites are affected, but several weeks ago we had the same issue with one of those 2 websites (and another one), but not the other.
We also see different errors prior to the timeout issues:
140059089770432:error:1408F119:SSL routines:ssl3_get_record:decryption failed or bad record mac:../deps/openssl/openssl/ssl/record/ssl3_record.c:677
and
getaddrinfo EAI_AGAIN <hostname>

Make sure you are not using Alpine based image.

I'm not since I'm running Uptime Kuma with Cloudron, which uses Ubuntu based docker images

For SSL error, please double check http:// vs https://

All sites in Uptime Kuma are configured with https, so there shoudn't be any reason for Uptime Kuma trying to access the sites over http.

@mrafols
Copy link

mrafols commented Jan 24, 2023

For me still happening
image

@d-delaey
Copy link

d-delaey commented Jan 24, 2023

For me still happening image

Yes it still happening to me aswell

@xeruf
Copy link

xeruf commented Feb 1, 2023

Getting this error when trying to send a webhook, as in #1697, running via pm2

@webysther
Copy link

Start to happen to me after update the docker image.

@webysther
Copy link

Same issue here, suddenly we are getting "timeout of 48000ms exceeded" on most of the checks.
Anyone has any clue?

This may help: https://github.com/louislam/uptime-kuma/wiki/Troubleshooting

No curl installed in docker

@webysther
Copy link

webysther commented Feb 1, 2023

image

After stop docker, pull new image and start docker.

@webysther
Copy link

image

@webysther
Copy link

image

@webysther
Copy link

ping and curl working from inside the container, nothing is working from this.

@alwynispat
Copy link

alwynispat commented Mar 19, 2023

same here using the latest image.
https://github.com/louislam/uptime-kuma/wiki/Troubleshooting
using the troubleshooting guide above, the container was able to curl and ping the urls, however, the urls still appear as down in kuma

@m-primo
Copy link

m-primo commented Mar 22, 2023

I have the same issue for months.. At first I thought this issue from my VPS provider, but after I had big fights with multiple VPS providers, I found out it's only happening from Uptime Kuma.

I think I'll just look for another tool and apologize to the VPS providers!

@XtremeOwnageDotCom
Copy link

Just a ping- its still a problem.

image

image

image

But, container logs, do tell me to report it. So. Here it is.

image

Deleting and recreating the monitor did nothing to help.

However, after deleting the pod, and letting it recreate, monitor did start working.

@ofifoto
Copy link

ofifoto commented May 1, 2023

related: #2346

@github-actions
Copy link

We are clearing up our old issues and your ticket has been open for 3 months with no activity. Remove stale label or comment or this will be closed in 2 days.

@github-actions github-actions bot added the Stale label Jul 31, 2023
@ofifoto
Copy link

ofifoto commented Jul 31, 2023

stalebot, really?

@CommanderStorm
Copy link
Collaborator

CommanderStorm commented Jul 31, 2023

stalebot, really?

Stalebot is active on all issues using the help template.
=> Closing stale help requests is by design because often people don't answer or issues are not reproduce or we fix things in further releases but forget to close them leading to immortal zombies.

Until now this issue has not really provided a minimally reproducible example (see https://stackoverflow.com/help/minimal-reproducible-example)
If somebody provides such an example I can have a look, but currently this issue doesn't really give enough context to debug against.
There are multiple factors (number of monitors, capacity of the host, montored system) which can lead to timeouts.

@github-actions github-actions bot removed the Stale label Jul 31, 2023
@michel-thomas
Copy link

I had the problem yesterday, and finally found what occurs…

After a maintenance reboot from our provider, IPv4 remains the same, but IPv6 changed. I would have never seen this change if uptime kuma didn't alert.

I don't know why yet, but my uptime kuma check https services via IPv6 while my desktop ISP still only works with IPv4.
Updating my DNS with the new IPv6 solve the problem.

Copy link

github-actions bot commented Nov 1, 2023

We are clearing up our old issues and your ticket has been open for 3 months with no activity. Remove stale label or comment or this will be closed in 2 days.

@github-actions github-actions bot added the Stale label Nov 1, 2023
Copy link

github-actions bot commented Nov 3, 2023

This issue was closed because it has been stalled for 2 days with no activity.

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Nov 3, 2023
@pixelbart
Copy link

Same issues here

@fantasticrabbit
Copy link

Same issue here. Was stable for 16-7 months using Node+PM2, switching to Docker was stable for 2 months and then started showing all sites down every 4-10 hours. Was not able to find anything in the container logs to troubleshoot. We're planning to switch back to just running the Node app.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests