[1/n] add application level autoscaling policy in schema #57535

abrarsheikh · 2025-10-08T02:53:58Z

part 1 of #56149

move _serialized_policy_def into AutoscalingPolicy from AutoscalingConfig. We need this in order to reuse AutoscalingPolicy for application-level autoscaling.
Make autoscaling_policy a top-level config in ServeApplicationSchema.

Signed-off-by: abrar <abrar@anyscale.com>

gemini-code-assist

Code Review

This pull request refactors the autoscaling policy configuration by moving the serialization logic from AutoscalingConfig into AutoscalingPolicy. It also introduces an application-level autoscaling_policy in ServeApplicationSchema. The changes are logical and improve encapsulation.

I've identified a critical issue in the protobuf schema update that breaks wire compatibility, and a medium-severity issue regarding schema definition in ServeApplicationSchema. Please see the detailed comments.

gemini-code-assist · 2025-10-08T02:55:19Z

src/ray/protobuf/serve.proto

  // The autoscaling policy definition.
-  AutoscalingPolicy policy = 12;
+  AutoscalingPolicy policy = 11;

  // Target number of in flight requests per replica. This is the primary configuration
  // knob for replica autoscaler. Lower the number, the more rapidly the replicas
  // scales up. Must be a non-negative integer.
-  double target_ongoing_requests = 13;
+  double target_ongoing_requests = 12;

  // The multiplicative "gain" factor to limit upscale.
-  optional double upscaling_factor = 14;
+  optional double upscaling_factor = 13;

  // The multiplicative "gain" factor to limit downscale.
-  optional double downscaling_factor = 15;
+  optional double downscaling_factor = 14;

  // How long to wait before scaling down replicas from 1 to 0
-  optional double downscale_to_zero_delay_s = 16;
+  optional double downscale_to_zero_delay_s = 15;


Renumbering protobuf fields is a wire-incompatible breaking change. An older component serializing an AutoscalingConfig message will use field numbers 12, 13, etc., for policy, target_ongoing_requests, etc. A newer component will interpret these as target_ongoing_requests, upscaling_factor, etc., leading to silent data corruption or deserialization errors.

To maintain compatibility, you should not renumber the existing fields. Please revert the field numbers to their original values. It's also a good practice to mark the removed field number as reserved to prevent accidental reuse, like this:

... optional double downscale_smoothing_factor = 10; reserved 11; // The autoscaling policy definition. AutoscalingPolicy policy = 12; ...

Suggested change

// The autoscaling policy definition.

AutoscalingPolicy policy = 12;

AutoscalingPolicy policy = 11;

// Target number of in flight requests per replica. This is the primary configuration

// knob for replica autoscaler. Lower the number, the more rapidly the replicas

// scales up. Must be a non-negative integer.

double target_ongoing_requests = 13;

double target_ongoing_requests = 12;

// The multiplicative "gain" factor to limit upscale.

optional double upscaling_factor = 14;

optional double upscaling_factor = 13;

// The multiplicative "gain" factor to limit downscale.

optional double downscaling_factor = 15;

optional double downscaling_factor = 14;

// How long to wait before scaling down replicas from 1 to 0

optional double downscale_to_zero_delay_s = 16;

optional double downscale_to_zero_delay_s = 15;

// The autoscaling policy definition.

AutoscalingPolicy policy = 12;

// Target number of in flight requests per replica. This is the primary configuration

// knob for replica autoscaler. Lower the number, the more rapidly the replicas

// scales up. Must be a non-negative integer.

double target_ongoing_requests = 13;

// The multiplicative "gain" factor to limit upscale.

optional double upscaling_factor = 14;

// The multiplicative "gain" factor to limit downscale.

optional double downscaling_factor = 15;

// How long to wait before scaling down replicas from 1 to 0

optional double downscale_to_zero_delay_s = 16;

+1, we shouldn't numbers assigned to existing fields, this can make the code backward incompatible

this is not a problem because we dont store these proto records in DB and use it in different session

gemini-code-assist · 2025-10-08T02:55:19Z

python/ray/serve/schema.py

+    autoscaling_policy: dict = Field(
+        default={},
+        description=(
+            "Autoscaling policy for the application. "
+            "If null, serve fallbacks to autoscaling policy in each deployment. "
+            "This option is under development and not yet supported."
+        ),
+    )


The type hint and default value for autoscaling_policy are inconsistent with its description. The description mentions "If null...", which implies the field should be optional and can be None.

The type hint should be Optional[dict] to allow None as a value.

The default value should be None to match the "if null" condition described.

Using default={} creates a mutable default value, which can lead to unexpected behavior. It's better to use default=None or default_factory=dict. In this case, default=None is the most clear and correct choice based on the description.

Suggested change

autoscaling_policy: dict = Field(

default={},

description=(

"Autoscaling policy for the application. "

"If null, serve fallbacks to autoscaling policy in each deployment. "

"This option is under development and not yet supported."

),

)

autoscaling_policy: Optional[dict] = Field(

default=None,

description=(

"Autoscaling policy for the application. "

"If null, serve fallbacks to autoscaling policy in each deployment. "

"This option is under development and not yet supported."

),

)

Signed-off-by: abrar <abrar@anyscale.com>

python/ray/serve/schema.py

python/ray/serve/config.py

zcin · 2025-10-09T23:25:11Z

@abrarsheikh merge conflicts

…brar-schema

Signed-off-by: abrar <abrar@anyscale.com>

Kishanthan · 2025-10-10T15:11:54Z

LGTM

add application level autoscaling policy in schema

9e09e1a

Signed-off-by: abrar <abrar@anyscale.com>

abrarsheikh requested a review from a team as a code owner October 8, 2025 02:53

This comment was marked as outdated.

Sign in to view

gemini-code-assist bot reviewed Oct 8, 2025

View reviewed changes

fix type

a51e63a

Signed-off-by: abrar <abrar@anyscale.com>

abrarsheikh added the go add ONLY when ready to merge, run all tests label Oct 8, 2025

ray-gardener bot added the serve Ray Serve Related Issue label Oct 8, 2025

abrarsheikh requested review from akyang-anyscale, harshit-anyscale and zcin and removed request for akyang-anyscale October 8, 2025 17:56

zcin approved these changes Oct 9, 2025

View reviewed changes

python/ray/serve/schema.py Outdated Show resolved Hide resolved

python/ray/serve/config.py Outdated Show resolved Hide resolved

abrarsheikh added 2 commits October 9, 2025 23:38

Merge branch 'master' of github.com:ray-project/ray into SERVE-1215-a…

d17d1a1

…brar-schema

change name to policy function

2ce7498

Signed-off-by: abrar <abrar@anyscale.com>

zcin enabled auto-merge (squash) October 10, 2025 00:26

fix name

292cbcd

Signed-off-by: abrar <abrar@anyscale.com>

github-actions bot disabled auto-merge October 10, 2025 01:56

This comment was marked as outdated.

Sign in to view

fix test

84b4382

Signed-off-by: abrar <abrar@anyscale.com>

zcin merged commit 023e470 into master Oct 10, 2025
6 checks passed

zcin deleted the SERVE-1215-abrar-schema branch October 10, 2025 17:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[1/n] add application level autoscaling policy in schema #57535

[1/n] add application level autoscaling policy in schema #57535

Uh oh!

abrarsheikh commented Oct 8, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 8, 2025

Uh oh!

harshit-anyscale Oct 9, 2025

Uh oh!

abrarsheikh Oct 9, 2025

Uh oh!

gemini-code-assist bot Oct 8, 2025

Uh oh!

Uh oh!

Uh oh!

zcin commented Oct 9, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Kishanthan commented Oct 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[1/n] add application level autoscaling policy in schema #57535

[1/n] add application level autoscaling policy in schema #57535

Uh oh!

Conversation

abrarsheikh commented Oct 8, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

harshit-anyscale Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

abrarsheikh Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zcin commented Oct 9, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Kishanthan commented Oct 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants