-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH] - Setup uptime monitoring for nebari deployment #2557
Comments
Kuberhealthy https://github.com/kuberhealthy/kuberhealthy looks like it might be a promising tool for this. However, I think this should be a meta issue. There are different meanings to up for different services. I think we should create a task that is part of the meta issue for each service that we want to monitor |
I was thinking we can add something like the blackbox_exporter in prometheus, to export uptime metrics for all the services and then create a dashboard in Grafana for the same. Here is an example configuration file for the blackbox_exporter: https://github.com/prometheus/blackbox_exporter/blob/3dd5dfeaabc630ca0c2ec722a07f9755159ae0dd/example.yml |
@dcmcand does using the k8s python client instead of tf for kuberhealthy have any implications related to cross cloud compatibility. Any other pros/cons? |
closed by #2667 |
Feature description
Uptime monitoring for Nebari and all services.
Value and/or benefit
Anything else?
No response
The text was updated successfully, but these errors were encountered: