-
Notifications
You must be signed in to change notification settings - Fork 97
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NFR Scale test doesn't count all reloads and batches #2009
Comments
@pleshakov did other tests run against this instance of NGF before this one? |
@kate-osborn |
That's good. Well, in that case, we may be able to simplify the Prometheus queries and remove the start and end times. |
the queries that migrated from test instructions to the nginx-gateway-fabric/tests/suite/scale_test.go Lines 382 to 390 in 5f9ec2d
|
Ok, so NGF is installed before each test, but the prometheus instance is shared among all the tests. I wonder if the start time is what's causing this issue. |
I see. we do use pod name though in each query, which should be unique per installation |
I'm thinking it has more to do with when prometheus has scraped the metrics from NGF. Prometheus is configured to scrape every 10 seconds, right? So maybe it's not the start time, but the time we send the query. Is it possible that we send the query before prometheus has scraped the latest metrics from NGF? |
quite possible. there is a 2 times scrapes interval timeout after the test finishes. after that, we also check that mertics exist at the end time
nginx-gateway-fabric/tests/suite/scale_test.go Lines 290 to 304 in 5f9ec2d
that should have prevented not having recent metrics in Prometheus. but it somehow doesn't work :( |
Problem: Scale test is not part of Github actions pipeline Solution: - Add NFR scale test to GitHub actions pipeline along other NFR tests. - Increase the size of the cluster used for NFR tests, as the scale test requires bigger size. Testing: - Successfully run with NGINX -- #2002 - Successfully run with NGINX Plus -- #2017 Some scale test issues were discovered: - #2023 - #2009 Closes #1927
Problem: Scale test is not part of Github actions pipeline Solution: - Add NFR scale test to GitHub actions pipeline along other NFR tests. - Increase the size of the cluster used for NFR tests, as the scale test requires bigger size. Testing: - Successfully run with NGINX -- #2002 - Successfully run with NGINX Plus -- #2017 Some scale test issues were discovered: - #2023 - #2009 Closes #1927
NFR Scale test doesn't count all reloads and batches.
Example of a run:
However, looking at the NGF logs, there are 109 reloads and the last batchID is 114:
Full log https://github.com/nginxinc/nginx-gateway-fabric/blob/d443d6a55988eae8091d230dbbd90ceef1de2c8f/tests/results/scale/edge/TestScale_UpstreamServers/ngf.log
Acceptance criteria:
The text was updated successfully, but these errors were encountered: