-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[COST-3758] retry failed operator queries up to 5 times #195
Conversation
Codecov Report
@@ Coverage Diff @@
## main #195 +/- ##
==========================================
+ Coverage 89.07% 89.17% +0.09%
==========================================
Files 11 11
Lines 2390 2457 +67
==========================================
+ Hits 2129 2191 +62
- Misses 183 187 +4
- Partials 78 79 +1
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report in Codecov by Sentry.
|
d3b7484
to
0c1fdf5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A handful of comments/suggestions but nothing major.
Kudos, SonarCloud Quality Gate passed! |
With the changes to the script, I've been using this series of make commands in order to deploy the operator to a cluster:
In order to deploy an operator via a CSV like this, an OperatorGroup needs to be created:
I've built this little "proxy" server so that we can simulate slow queries:
https://github.com/maskarb/devfile-sample-go-basic
In the developer view of an Openshift cluster, that repo can be imported and deployed. Then KokuMetricsConfig can be updated like this:
With this service address, the prometheus traffic will flow thru the "proxy" server and will randomly sleep for longer than 10 seconds. This will cause a timeout to occur which is visible in the operator logs. Once that query timeout occurs, that individual query will be requeued and tried again.