-
Notifications
You must be signed in to change notification settings - Fork 106
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Start node-problem-detector on deployed instances to collect memory stats #1523
Conversation
Enables the Node Problem Detector on Container-Optimized OS, which collects metrics on memory usage, open tcp connections, processes, cpu steal, swap usage, on top of existing host-collected metrics.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unfortunately this metadata flag doesn't seem to get picked up when using any of the |
These look good, but I'm not sure if they actually work, and if they need another review. Also I think there is a conflict with #1529. |
Moving to draft while I keep trying options that work with the |
11b53f2
to
faff2de
Compare
When deploying containers to containers I just cannot get these memory metrics out. Closing. :/ |
Motivation
The current default metrics available in gcloud about our zebrad nodes deployed on VMs don't have metrics about memory usage.
Solution
Add google-monitoring-enabled=true metadata to deployed instances
Enables the Node Problem Detector on Container-Optimized OS, which collects metrics on memory usage, open tcp connections, processes, cpu steal, swap usage, on top of existing host-collected metrics.
Review
Not urgent.