-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hostmetrics receiver logs spurious errors when it races with process termination #30434
Comments
Pinging code owners for receiver/hostmetrics: @dmitryax @braydonk. See Adding Labels via Comments if you do not have permissions to add labels yourself. |
Thanks for the report @ringerc - would you like to try and offer a fix for this issue? |
@atoulme I'd be happy to give it a go though it'll be a while before I can queue it up. In the meantime hopefully others will see this and at least know what the errors are from / why they appear. |
This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping Pinging code owners: See Adding Labels via Comments if you do not have permissions to add labels yourself. |
This issue has been closed as inactive because it has been stale for 120 days with no activity. |
Describe the bug
If a process terminates midway through a hostmetrics process scrape, the scraper will log a spurious error like
because
/proc/1965300
existed when it listed/proc
, but had vanished by the time it tried to read its contents.This error is expected and can be safely silenced. It'd make sense to
stat
the directory on I/O error, and if it'sENOENT
, suppress the error. Or just ignoreENOENT
for subdirs entirely since that's the only likely cause for this error.Steps to reproduce
Run a workload that creates and terminates lots of processes, while running hostmetrics
process
receiver.What did you expect to see?
No
error
level logs.What did you see instead?
error
level logs about failure to read procfs entries for processes that vanished during the scrape.What version did you use?
v0.91.0
What config did you use?
A generic sample config with the
hostmetrics
receiver enabled.Environment
Generic k8s (kind)
The text was updated successfully, but these errors were encountered: