-
Notifications
You must be signed in to change notification settings - Fork 898
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Prometheus Alerts Integration #14238
Comments
@joelddiez @shalomnaim1 Please review |
You have a typo on the setup section
I believe you meant to implemented, right? |
it was nice if you set a link to |
cc @joelddiaz |
It would be helpful to have some text explaining the various 'annotation' fields for the Prometheus alerts. Specifically, listing the acceptable values for miqTarget & severity. |
@joelddiaz I'm working now on implementing provider targeted alerts, will update docs afterwards |
@joelddiaz @ilackarms updated the document. Note now one should setup two miq alerts and not one. |
@moolitayer unable to currently set up any alerts |
ManageIQ/manageiq-ui-classic#2714 should help |
@shalomnaim1 can you please paste the two alert definitions you are using in tests? |
@moolitayer , In the Debug section under step 2, you added an example of how to get the current firing alerts form Prometheus, it seems like you used 2 different variables for accessing the route, the route saved to |
In the examples here you're defining the severity as both an annotation and a label:
and
@moolitayer Is there a recommendation as to which is preferable? |
This issue has been automatically marked as stale because it has not been updated for at least 6 months. If you can still reproduce this issue on the current release or on Thank you for all your contributions! |
This is great documentation 👍 |
@cben unrecognized command 'close', ignoring... Accepted commands are: add_label, add_reviewer, assign, close_issue, move_issue, remove_label, rm_label, set_milestone |
@miq-bot close-issue |
The propose of this issue is to document the status & usage of ManageIQ integration with Prometheus alerts for Kubernetes/OpenShift.
Table of Contents
Description
Status
Setup
Debugging
Alerts
Description
Prometheus is used as an external alerting component and ManageIQ collects alerts from it and attaches them to inventory objects.
It is then possible to view on going alerts in ManageIQ's Monitoring screen, manage their life cycle (view data & related objects, assign, acknowledge, comment).
Status
Tech Preview for the Gaprindashvili release
Setup
The only supported way is to install OpenShift is using openshift-ansible, first implemented in:
Create ansible role for deploying prometheus on openshift openshift/openshift-ansible#4509
(add inventory flag:
openshift_hosted_prometheus_deploy=true
)configure alerts in prometheus:
See some common alerts
Note to reload the configuration please delete the pod OR send a HUP signal to the Prometheus process
Go to: control -> explorer, create one container node alert based on "all datawarehouse alerts" and one provider alert based on "all datawarehouse alerts"Add each new Alert To an Alert ProfileDebugging
Prometheus Side
ManageIQ - Worker Management
ManageIQ - Collection and Alerting logic
Alerts
Here are some common usable alerts:
The pod count alert can be easily triggered using:
Triggering the "Too Many Authenticated Requests"
The text was updated successfully, but these errors were encountered: