aws_applicationautoscaling: Enable SageMaker serverless provisioned concurrency utilization predefined metric #29065
Labels
@aws-cdk/aws-applicationautoscaling
Related to AWS Application Auto Scaling
effort/medium
Medium work item – several days of effort
feature-request
A feature should be added or improved.
p2
Describe the feature
It appears to be currently impossible to use the provisioned concurrency utilization predefined metric for a SageMaker serverless endpoint. I know how to do this for a Lambda function and have even tried to use
aws_cdk.aws_applicationautoscaling.PredefinedMetric.LAMBDA_PROVISIONED_CONCURRENCY_UTILIZATION
, but I get the following error:Use Case
I would like to use autoscaling along with provisioned concurrency for serverless SageMaker endpoints.
Proposed Solution
I think that
aws_cdk.aws_applicationautoscaling.PredefinedMetric.SAGEMAKER_VARIANT_PROVISIONED_CONCURRENCY_UTILIZATION
enum value here could be added in.Full code example:
Other Information
No response
Acknowledgements
CDK version used
2.127.0
Environment details (OS name and version, etc.)
Darwin M-AI813838 23.3.0 Darwin Kernel Version 23.3.0: Wed Dec 20 21:30:44 PST 2023; root:xnu-10002.81.5~7/RELEASE_ARM64_T6000 arm64
The text was updated successfully, but these errors were encountered: