-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix failing e2e test #7620
Fix failing e2e test #7620
Conversation
08f7728
to
0e6cb7f
Compare
Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM nice catch! But to prevent this, we should probably pin the image we use here too?
edit: Avalanche only has a main tag, so can't really replace. So merging!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you!
Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com>
Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com> Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com>
* Proxy: Query goroutine leak when `store.response-timeout` is set (#7618) time.AfterFunc() returns a time.Timer object whose C field is nil, accroding to the documentation. A goroutine blocks forever on reading from a `nil` channel, leading to a goroutine leak on random slow queries. Signed-off-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> * pkg/clientconfig: fix TLS configs with only CA (#7634) 065e3dd introduced a regression: TLS configurations for Thanos Ruler query and alerting with only a CA file failed to load. For instance, the following snippet is a valid query configuration: ``` - static_configs: - prometheus.example.com:9090 scheme: https http_config: tls_config: ca_file: /etc/ssl/cert.pem ``` The test fixtures (CA, certificate and key files) are copied from prometheus/common and are valid until 2072. Signed-off-by: Simon Pasquier <spasquie@redhat.com> * Cut patch release v0.36.1 Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> * Fix failing e2e test (#7620) Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com> Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> --------- Signed-off-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> Signed-off-by: Simon Pasquier <spasquie@redhat.com> Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com> Co-authored-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> Co-authored-by: Simon Pasquier <spasquie@redhat.com> Co-authored-by: Harry John <johrry@amazon.com>
* CHANGELOG: Mark 0.36 as in progress Signed-off-by: Michael Hoffmann <mhoffm@posteo.de> * Cut release candidate v0.36.0-rc.0 (#7490) Signed-off-by: Michael Hoffmann <mhoffm@posteo.de> * Cut release candidate 0.36.0 rc.1 (#7510) * *: fix server grpc histograms (#7493) Signed-off-by: Michael Hoffmann <mhoffm@posteo.de> * Close endpoints after the gRPC server has terminated (#7509) Endpoints are currently closed as soon as we receive a SIGTERM or SIGINT. This causes in-flight queries to get cancelled since outgoing connections get closed instantly. This commit moves the endpoints.Close call after the grpc server shutdown to make sure connections are available as long as the server is running. Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com> * Cut release candidate v0.36.0-rc.1 Signed-off-by: Michael Hoffmann <mhoffm@posteo.de> --------- Signed-off-by: Michael Hoffmann <mhoffm@posteo.de> Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com> Co-authored-by: Filip Petkovski <filip.petkovsky@gmail.com> * Cut release v0.36.0 (#7578) Signed-off-by: Michael Hoffmann <mhoffm@posteo.de> * Cut patch release `v0.36.1` (#7636) * Proxy: Query goroutine leak when `store.response-timeout` is set (#7618) time.AfterFunc() returns a time.Timer object whose C field is nil, accroding to the documentation. A goroutine blocks forever on reading from a `nil` channel, leading to a goroutine leak on random slow queries. Signed-off-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> * pkg/clientconfig: fix TLS configs with only CA (#7634) 065e3dd introduced a regression: TLS configurations for Thanos Ruler query and alerting with only a CA file failed to load. For instance, the following snippet is a valid query configuration: ``` - static_configs: - prometheus.example.com:9090 scheme: https http_config: tls_config: ca_file: /etc/ssl/cert.pem ``` The test fixtures (CA, certificate and key files) are copied from prometheus/common and are valid until 2072. Signed-off-by: Simon Pasquier <spasquie@redhat.com> * Cut patch release v0.36.1 Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> * Fix failing e2e test (#7620) Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com> Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> --------- Signed-off-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> Signed-off-by: Simon Pasquier <spasquie@redhat.com> Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com> Co-authored-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> Co-authored-by: Simon Pasquier <spasquie@redhat.com> Co-authored-by: Harry John <johrry@amazon.com> --------- Signed-off-by: Michael Hoffmann <mhoffm@posteo.de> Signed-off-by: Filip Petkovski <filip.petkovsky@gmail.com> Signed-off-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> Signed-off-by: Simon Pasquier <spasquie@redhat.com> Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com> Co-authored-by: Michael Hoffmann <mhoffm@posteo.de> Co-authored-by: Filip Petkovski <filip.petkovsky@gmail.com> Co-authored-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> Co-authored-by: Simon Pasquier <spasquie@redhat.com> Co-authored-by: Harry John <johrry@amazon.com>
* Proxy: Query goroutine leak when `store.response-timeout` is set (thanos-io#7618) time.AfterFunc() returns a time.Timer object whose C field is nil, accroding to the documentation. A goroutine blocks forever on reading from a `nil` channel, leading to a goroutine leak on random slow queries. Signed-off-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> * pkg/clientconfig: fix TLS configs with only CA (thanos-io#7634) 065e3dd introduced a regression: TLS configurations for Thanos Ruler query and alerting with only a CA file failed to load. For instance, the following snippet is a valid query configuration: ``` - static_configs: - prometheus.example.com:9090 scheme: https http_config: tls_config: ca_file: /etc/ssl/cert.pem ``` The test fixtures (CA, certificate and key files) are copied from prometheus/common and are valid until 2072. Signed-off-by: Simon Pasquier <spasquie@redhat.com> * Cut patch release v0.36.1 Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> * Fix failing e2e test (thanos-io#7620) Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com> Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> --------- Signed-off-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> Signed-off-by: Simon Pasquier <spasquie@redhat.com> Signed-off-by: Saswata Mukherjee <saswataminsta@yahoo.com> Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com> Co-authored-by: Mikhail Nozdrachev <mikhail.nozdrachev@aiven.io> Co-authored-by: Simon Pasquier <spasquie@redhat.com> Co-authored-by: Harry John <johrry@amazon.com>
Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com>
Changes
There seems to be a bug in avalanche. When
--metric-interval
alone is set, no timeseries are returned and no write requests will be made by avalanche. See: write.go#L143Using
--series-interval
and--sample-interval
seems to fix the test.I haven't had a chance to look at avalanche bug in depth. But this PR should unblock Thanos e2e tests.
Verification