-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
auditbeat: auditd error messages after update to 7.13.2 #26668
Comments
Pinging @elastic/security-external-integrations (Team:Security-External Integrations) |
@ynirk Can I check whether it is consistently the same hosts? and whether the hosts are under load during this start up? Looking at the relevant changes between 7.12.0 and 7.13.2 there is no change that would introduce this behaviour, but the previous deadlock behaviour may have hidden it by failing out. Are you able to test increasing the number of retries or increasing the backoff on the retries? |
@efd6 sorry for the delay i totally miss the ping. |
Thanks, @ynirk. Are you able to test whether this behaviour persists with a version built with the retry relaxations I mention above? Also, are you able to provide the log lines that follow that error? The loop that handles this retries until there is catastrophic failure and no audit monitoring client can be started. It would be helpful to know how many loop iterations fail to obtain a response and interesting to know how many events are lost (log lines corresponding to this func). This latter query will be easier to address than the first. |
Hi! We're labeling this issue as |
After updating auditbeat to 7.13.2 (from 7.12.0) we start seeing the following error message:
It only happens on a small proportion of deployed servers after auditbeat restart. Exemple on a specific instance
logs started right after the update and we see some after auditbeat restart the next day.
The update has been deployed to fix kauditd deadlock issue (#26031) we were experiencing on some hosts. I'm wondering if it could be the same root cause ?
The text was updated successfully, but these errors were encountered: