add deployment status check in test #58087

abrarsheikh · 2025-10-24T16:12:33Z

I suspect that when we deploy the app config, we dont wait long enough before sending traffic, so requests could go to the wrong version

Signed-off-by: abrar <abrar@anyscale.com>

gemini-code-assist

Code Review

This pull request improves test stability by adding checks to wait for deployments to become healthy before proceeding with the tests. This is a good practice to avoid flakiness. My review includes suggestions to consolidate the health checks for multiple deployments into a single, more efficient call to improve code readability and performance.

gemini-code-assist · 2025-10-24T16:14:53Z

python/ray/serve/tests/test_autoscaling_policy.py

+        wait_for_condition(
+            check_deployment_status, name="A", expected_status=DeploymentStatus.HEALTHY
+        )
+        wait_for_condition(
+            check_deployment_status, name="B", expected_status=DeploymentStatus.HEALTHY
+        )


For improved readability and efficiency, you can combine the health checks for multiple deployments into a single wait_for_condition call. This avoids the overhead of multiple wait_for_condition calls and can be more efficient by fetching the status of all deployments at once.

A helper function could be defined for this. For example:

def check_deployments_healthy(names: List[str], app_name: str = SERVE_DEFAULT_APP_NAME): statuses = serve.status().applications[app_name].deployments return all( statuses.get(name) and statuses[name].status == DeploymentStatus.HEALTHY for name in names )

And then call it like this:

wait_for_condition(check_deployments_healthy, names=["A", "B"])

This helper could be defined locally within the test method or, if it's generally useful, in ray.serve._private.test_utils.

gemini-code-assist · 2025-10-24T16:14:53Z

python/ray/serve/tests/test_autoscaling_policy.py

+        wait_for_condition(
+            check_deployment_status, name="A", expected_status=DeploymentStatus.HEALTHY
+        )
+        wait_for_condition(
+            check_deployment_status, name="B", expected_status=DeploymentStatus.HEALTHY
+        )


Similar to my other comment, these two wait_for_condition calls can be consolidated into a single call for better readability and efficiency. This can be done using a helper function that checks the status of multiple deployments at once.

akyang-anyscale · 2025-10-24T17:45:47Z

I suspect that when we deploy the app config, we dont wait long enough before sending traffic, so requests could go to the wrong version

isn't check_running supposed to do that waiting for us?

abrarsheikh · 2025-10-24T17:52:12Z

I suspect that when we deploy the app config, we dont wait long enough before sending traffic, so requests could go to the wrong version

isn't check_running supposed to do that waiting for us?

is app running imply deployment == healthy ? I am 90% sure that is true.

but also check_running was missing from test_autoscaling_policy_enable_disable

akyang-anyscale · 2025-10-24T18:24:29Z

is app running imply deployment == healthy ? I am 90% sure that is true

so is it redundant then?

python/ray/serve/tests/test_autoscaling_policy.py

Signed-off-by: abrar <abrar@anyscale.com>

I suspect that when we deploy the app config, we dont wait long enough before sending traffic, so requests could go to the wrong version --------- Signed-off-by: abrar <abrar@anyscale.com> Signed-off-by: xgui <xgui@anyscale.com>

I suspect that when we deploy the app config, we dont wait long enough before sending traffic, so requests could go to the wrong version --------- Signed-off-by: abrar <abrar@anyscale.com>

I suspect that when we deploy the app config, we dont wait long enough before sending traffic, so requests could go to the wrong version --------- Signed-off-by: abrar <abrar@anyscale.com> Signed-off-by: Aydin Abiar <aydin@anyscale.com>

I suspect that when we deploy the app config, we dont wait long enough before sending traffic, so requests could go to the wrong version --------- Signed-off-by: abrar <abrar@anyscale.com> Signed-off-by: Future-Outlier <eric901201@gmail.com>

add deployment status check in test

04340ca

Signed-off-by: abrar <abrar@anyscale.com>

abrarsheikh requested a review from a team as a code owner October 24, 2025 16:12

abrarsheikh added the go add ONLY when ready to merge, run all tests label Oct 24, 2025

abrarsheikh requested a review from akyang-anyscale October 24, 2025 16:13

gemini-code-assist bot reviewed Oct 24, 2025

View reviewed changes

akyang-anyscale reviewed Oct 24, 2025

View reviewed changes

python/ray/serve/tests/test_autoscaling_policy.py Outdated Show resolved Hide resolved

ray-gardener bot added the serve Ray Serve Related Issue label Oct 24, 2025

revert

714a4dd

Signed-off-by: abrar <abrar@anyscale.com>

akyang-anyscale approved these changes Oct 24, 2025

View reviewed changes

abrarsheikh merged commit 226a414 into master Oct 24, 2025
6 checks passed

abrarsheikh deleted the SERVE-1274-abrar-flaky branch October 24, 2025 22:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add deployment status check in test #58087

add deployment status check in test #58087

Uh oh!

abrarsheikh commented Oct 24, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 24, 2025

Uh oh!

gemini-code-assist bot Oct 24, 2025

Uh oh!

akyang-anyscale commented Oct 24, 2025

Uh oh!

abrarsheikh commented Oct 24, 2025

Uh oh!

akyang-anyscale commented Oct 24, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add deployment status check in test #58087

add deployment status check in test #58087

Uh oh!

Conversation

abrarsheikh commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

akyang-anyscale commented Oct 24, 2025

Uh oh!

abrarsheikh commented Oct 24, 2025

Uh oh!

akyang-anyscale commented Oct 24, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

abrarsheikh commented Oct 24, 2025 •

edited

Loading