Skip to content

Conversation

@abrarsheikh
Copy link
Contributor

@abrarsheikh abrarsheikh commented Oct 8, 2025

part 2 of #56149, a significant portion of the code is taken from the original PR.

This PR does not introduce any change in functionality. Autoscaling is still performed at the deployment level. This will help us make the transition towards application level autoscaling.

The only change in this PR

  1. is moving the autoscaling control loop from the deployment state to the application state.
  2. adding application autoscaling state class, in the new design autoscaling state manager will manage a list of application autoscaling states and each application autoscaling state will manage a list of deployment autoscaling states

@abrarsheikh abrarsheikh requested a review from a team as a code owner October 8, 2025 07:09
@abrarsheikh abrarsheikh added the go add ONLY when ready to merge, run all tests label Oct 8, 2025
cursor[bot]

This comment was marked as outdated.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request is a solid step in refactoring the autoscaling logic. The movement of the control loop to the application state is well-executed, and the introduction of ApplicationAutoscalingState provides a clear structure for future application-level autoscaling policies. The new tests are comprehensive and cover many edge cases, which is great to see.

I've identified a few areas for improvement:

  • A bug in a test helper method that could lead to incorrect test behavior.
  • An opportunity to simplify some logic using a dictionary comprehension for better readability.
  • A docstring that could be updated to more accurately reflect the current implementation.

Overall, this is a high-quality contribution. Addressing these points will further enhance the clarity and robustness of the code.

cursor[bot]

This comment was marked as outdated.

@ray-gardener ray-gardener bot added the serve Ray Serve Related Issue label Oct 8, 2025
cursor[bot]

This comment was marked as outdated.

cursor[bot]

This comment was marked as outdated.

cursor[bot]

This comment was marked as outdated.

cursor[bot]

This comment was marked as outdated.

Copy link
Contributor

@akyang-anyscale akyang-anyscale left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the code lgtm, have not reviewed the tests yet

cursor[bot]

This comment was marked as outdated.

@Kishanthan
Copy link
Contributor

LGTM.

Copy link
Contributor

@zcin zcin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

overall lgtm

@abrarsheikh abrarsheikh force-pushed the SERVE-1216-abrar-auto branch from dbf3f53 to d7d25d2 Compare October 14, 2025 18:58
Signed-off-by: abrar <abrar@anyscale.com>
@abrarsheikh abrarsheikh force-pushed the SERVE-1216-abrar-auto branch from d7d25d2 to 9c3b266 Compare October 14, 2025 18:59
@zcin zcin enabled auto-merge (squash) October 14, 2025 20:25
@github-actions github-actions bot disabled auto-merge October 14, 2025 21:17
@zcin zcin merged commit 3fcd5ed into master Oct 14, 2025
6 checks passed
@zcin zcin deleted the SERVE-1216-abrar-auto branch October 14, 2025 23:05
justinyeh1995 pushed a commit to justinyeh1995/ray that referenced this pull request Oct 20, 2025
ray-project#57548)

part 2 of ray-project#56149, a significant
portion of the code is taken from the original PR.

This PR does not introduce any change in functionality. Autoscaling is
still performed at the deployment level. This will help us make the
transition towards application level autoscaling.

The only change in this PR
1. is moving the autoscaling control loop from the deployment state to
the application state.
2. adding application autoscaling state class, in the new design
autoscaling state manager will manage a list of application autoscaling
states and each application autoscaling state will manage a list of
deployment autoscaling states

Signed-off-by: abrar <abrar@anyscale.com>
xinyuangui2 pushed a commit to xinyuangui2/ray that referenced this pull request Oct 22, 2025
ray-project#57548)

part 2 of ray-project#56149, a significant
portion of the code is taken from the original PR.

This PR does not introduce any change in functionality. Autoscaling is
still performed at the deployment level. This will help us make the
transition towards application level autoscaling.

The only change in this PR
1. is moving the autoscaling control loop from the deployment state to
the application state.
2. adding application autoscaling state class, in the new design
autoscaling state manager will manage a list of application autoscaling
states and each application autoscaling state will manage a list of
deployment autoscaling states

Signed-off-by: abrar <abrar@anyscale.com>
Signed-off-by: xgui <xgui@anyscale.com>
elliot-barn pushed a commit that referenced this pull request Oct 23, 2025
#57548)

part 2 of #56149, a significant
portion of the code is taken from the original PR.

This PR does not introduce any change in functionality. Autoscaling is
still performed at the deployment level. This will help us make the
transition towards application level autoscaling.

The only change in this PR
1. is moving the autoscaling control loop from the deployment state to
the application state.
2. adding application autoscaling state class, in the new design
autoscaling state manager will manage a list of application autoscaling
states and each application autoscaling state will manage a list of
deployment autoscaling states

Signed-off-by: abrar <abrar@anyscale.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
landscapepainter pushed a commit to landscapepainter/ray that referenced this pull request Nov 17, 2025
ray-project#57548)

part 2 of ray-project#56149, a significant
portion of the code is taken from the original PR.

This PR does not introduce any change in functionality. Autoscaling is
still performed at the deployment level. This will help us make the
transition towards application level autoscaling.

The only change in this PR
1. is moving the autoscaling control loop from the deployment state to
the application state.
2. adding application autoscaling state class, in the new design
autoscaling state manager will manage a list of application autoscaling
states and each application autoscaling state will manage a list of
deployment autoscaling states

Signed-off-by: abrar <abrar@anyscale.com>
Aydin-ab pushed a commit to Aydin-ab/ray-aydin that referenced this pull request Nov 19, 2025
ray-project#57548)

part 2 of ray-project#56149, a significant
portion of the code is taken from the original PR.

This PR does not introduce any change in functionality. Autoscaling is
still performed at the deployment level. This will help us make the
transition towards application level autoscaling.

The only change in this PR
1. is moving the autoscaling control loop from the deployment state to
the application state.
2. adding application autoscaling state class, in the new design
autoscaling state manager will manage a list of application autoscaling
states and each application autoscaling state will manage a list of
deployment autoscaling states

Signed-off-by: abrar <abrar@anyscale.com>
Signed-off-by: Aydin Abiar <aydin@anyscale.com>
Future-Outlier pushed a commit to Future-Outlier/ray that referenced this pull request Dec 7, 2025
ray-project#57548)

part 2 of ray-project#56149, a significant
portion of the code is taken from the original PR.

This PR does not introduce any change in functionality. Autoscaling is
still performed at the deployment level. This will help us make the
transition towards application level autoscaling.

The only change in this PR
1. is moving the autoscaling control loop from the deployment state to
the application state.
2. adding application autoscaling state class, in the new design
autoscaling state manager will manage a list of application autoscaling
states and each application autoscaling state will manage a list of
deployment autoscaling states

Signed-off-by: abrar <abrar@anyscale.com>
Signed-off-by: Future-Outlier <eric901201@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

go add ONLY when ready to merge, run all tests serve Ray Serve Related Issue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants