-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Server began to freeze after the last update #5661
Comments
It looks like HTTP server stop responding when I use the journal section to find some domains🤔 |
It's not only the HTTP server stop responding, connected devices also start saying "no internet access" either new or already connected. My AGH is running directly in router Xiaomi AX3600 with OpenWRT (amd64 edge build). |
It is same for me. |
It is same for me. |
Same issue here. There's a config file error. AGH modifies the config file and there's a syntax error. The only way I resolved this is by restoring a backup. |
Same here on ARMv7. |
The error also occurs here, on an x86 Pfsense system. |
Thanks for the reports, everyone. We're investigating but cannot reproduce it so far. Enabling and sharing verbose logs either here or through devteam@adguard.com, like OP did, would really help. @ammnt, if it really is the query log, does disabling it fix the issue? Also, are there any panics in e.g. @mouk, could you please elaborate? What kind of error do you see? There shouldn't be any syntax errors, but if you were following Edge, you could be affected by #5627, which shouldn't affect Beta users. |
@ainar-g, I have sent you the message just now. Check devteam@adguard.com mailbox please🙂 |
@ainar-g, I recorded a video with the problem reproducing and the log output at that point. I will send you this next 10 minutes. |
Updates #5661. Squashed commit of the following: commit 3fac63f Author: Ainar Garipov <A.Garipov@AdGuard.COM> Date: Fri Mar 31 17:24:00 2023 +0300 querylog: imp locks even more commit bf14ab9 Author: Ainar Garipov <A.Garipov@AdGuard.COM> Date: Fri Mar 31 17:09:25 2023 +0300 querylog: imp locks more commit 40e885f Author: Ainar Garipov <A.Garipov@AdGuard.COM> Date: Fri Mar 31 16:26:15 2023 +0300 querylog: imp locks, logs
@ammnt, thanks for the information. We have a few theories. The latest Edge release contains a fix for one of them. We're working on the other ones. |
v0.108.0-a.496+3575aa05 contains the second part of the tentative fix. |
same problem with v0.108.0-a.496+3575aa05 |
It may had happened to me I cant get any log as when it crashs the logs stops as well. |
@ainar-g, I have not tested the latest version on the edge channel, but I can say that my server has not freezed over the past few days, since I did not use the web interface🤔 |
I am running v0.108.0-a.496+3575aa05 on Ubuntu 22.04.2 with Docker, it runs several minutes and goes to unhealthy, please check. Thanks. |
same as Version: v0.108.0-a.497+2a0d0629 |
@ainar-g, I am sure that the Docker container fall down itself, as healthcheck knocks on HTTP port. We must try to run the container without healthcheck and if it does not fall, then it is about HTTP processing and the library that is responsible for this. |
@qqsir-dev, please report Docker healthcheck issues to #3290. Thanks. @ammnt, our current theory is that long-running querylog searches prevented new requests from being counted. We've made a few changes that address those issues. Were you (or anyone watching the issue) able to test the most recent Edge releases? |
I also encountered this problem on one of my AGH servers with a fairly high query volume, version v0.108.0-b.31. AGH is running on a Debian 11 x64 vm. Tried to get the logs, but when the problem occurs, the AGH binary no longer responds and does not output any logs. The Systemd Unit is also unable to shut down AGH, and will only force it to shut down after a timeout with SIGKILL The way to reproduce the problem is to click on "Query Log" in the WebGUI and after tens of seconds or minutes, AGH stops responding. |
Same problem here on openwrt master snapshot updating to AdGuard Home v0.108.0-a.497+2a0d0629. |
@ainar-g, I checked the latest version of the Docker image now. There are no changes - the server still falls after a while. The logs are not unusual🤷🏻♂️ |
We've been able to reproduce the issue overnight. We have a few new theories, but most of them have to do with bugs in the Safe Search feature. @ammnt, could you please disable Safe Search, restart AGH, and see if the situation improves? We'll keep investigating the code in the meantime. |
Thanks for trying it out. We are still investigating. |
I had rolled back to stable. But before it, if it does help, I had one client that uses safe-search. And I had indeed notice some "adults websites" being blocked and it can be related. |
I have tested the latest beta build. Works really better for now. Keep trying to reproduce it🫡 |
me too, its working good for about 10 more minutes. |
But I completely disabled safesearch service as @ainar-g told before👆🏼 |
I didn't use safe search, its working good now. |
But, Encryption on/off will stop http service. |
I don't use built-in encryption but it works fine now with the enabled safesearch too. |
I've tried v0.108.0-b.32. I doesn't work if safe search is enabled in client settings. It will become non responsive as before. |
@ammnt, we've made a few improvements to safe search as well as the debugging API. Could you please update to
These three files will contain the information that should help us find out, what is causing the issue. We're doing the same on our testing machines, but it'll help to have more data. |
Update: we had been able to reliably reproduce the freeze and have pushed a change that fixes the issue in our tests in Edge build |
Running v0.108.0-a.506+5d5a7295 for about one hour and no issues anymore. |
I updated to v0.108.0-a.506+5d5a7295 of my ADguard in LXC. It is runing over 1 hour and woks well till now |
Updated to v0.108.0-a.506+5d5a7295 and running fine again |
Version: v0.108.0-a.508+15bba281 running in router Xiaomi 3600 with safe search enable |
Thanks everyone for testing! Since a lot of people are indicating that AGH runs well after the fix, we'll close this issue. The fix will be included into the next Beta and Release builds, which should come in the next few days. |
Prerequisites
I have checked the Wiki and Discussions and found no answer
I have searched other issues and found no duplicates
I want to report a bug and not ask a question
Operating system type
Linux, Other (please mention the version in the description)
CPU architecture
AMD64
Installation
GitHub releases or script from README
Setup
On one machine
AdGuard Home version
v0.108.0-b.31
Description
What did you do?
Classic GH-script installation. Debian 11 5.10.0-21-amd64 #1 SMP Debian 5.10.162-1 (2023-01-21) x86_64 GNU/Linux
No LSB modules are available.
Distributor ID: Debian
Description: Debian GNU/Linux 11 (bullseye)
Release: 11
Codename: bullseye
Nothing special. Just updated to the last beta version and keep use as always. But now the server freezes. The verbose log is attached. Please start look from 22:43 MSK:
log.txt
Expected result
Actual result
Screenshots (if applicable)
Additional information
The text was updated successfully, but these errors were encountered: