[Bug] Batch-Inference Template - Model name in Pipeline Not being passed #8

georschi · 2022-01-26T10:50:01Z

When deploying this template, using a model registry with an approved model, the deployed SageMaker Pipeline does not contain the name of the SageMaker Model to use for the Batch inference.

The problem can be pinpointed in the cloudformation template in this line:
https://github.com/aws-samples/sagemaker-custom-project-templates/blob/main/batch-inference/seedcode/endpoint-config-template.yml#L70

      PipelineDefinition: 
        PipelineDefinitionBody: !Sub
          - ${PipelineDefinitionBody}
          - { ModelName: !GetAtt ModelToDeploy.ModelName }

Which doesn't work as expected since cloudformation cannot essentially perform an nested substitution.
Here the expectation is that the ${PipelineDefinitionBody} is substituted with the json value of the variable and then ModelName is replaces in that json. However, this "double" substitution is not possible with cloudFormation.

georschi · 2022-01-26T10:50:11Z

I can think of 2 solutions to this.

1st. - quick and dirty (tested and working but still not recommended for a template solution)
Do the following changes:

in seedcode/endpoint-config-template.yml change Line 70-72 with

        PipelineDefinitionBody:
          !Join
          - ''
          - - '{'
            - '"Parameters": [{"Name": "ModelName",    "Type": "String",   "DefaultValue": "'
            - !GetAtt ModelToDeploy.ModelName
            - '"},  {"Name": "BatchInstanceCount", "Type": "Integer", "DefaultValue": 1},  {"Name": "BatchInstanceType",   "Type": "String",   "DefaultValue": "ml.m5.xlarge"},  {"Name": "InputPath",   "Type": "String",   "DefaultValue": "s3://sagemaker-servicecatalog-seedcode-eu-west-1/dataset/abalone-dataset.csv"},  {"Name": "OutputPath", "Type": "String"}],'
            - !Ref PipelineDefinitionBody
            - '}'

in seedcode/build.py add in line 175

    pipeline_definition = pipeline_definition.replace("${ModelName}", "ModelToDeploy-HHu6ShVCAbPJ")
    import json
    temp = json.loads(pipeline_definition)
    temp.pop('Parameters')
    temp = json.dumps(temp)
    pipeline_definition = temp[1:-1]

Essentially what I'm proposing here is to break apart the Pipeline definition, remove the parameters, and re insert them in the CloudFormation template while "injecting" the Model name.
happy to explain in more detail if this doesn't seem clear.

2nd. more of a reachitecture
With the limitation of CloudFormation explained above, this template could be re-achitected so that the inference pipeline instead of having one step, the batch transform has the following 3 steps.
LambdaStep to read the latest model (or latest model package arn can be passed as input)
CreateModelStep to create a model
TransformStep for the actual transformation.

With this change, the SageMaker model is created within the pipeline execution so it doesn't need to be created at deployment time. Creating a model adds minimal overhead to the whole process so it wouldn't affect the performance of the execution.

An example of this can be seen here: https://github.com/aws-samples/sagemaker-pipelines-callback-step-for-batch-transform/blob/main/batch_transform_with_lambdastep.ipynb (scroll to bottom to see the pipeline diagram)

dgallitelli · 2022-08-23T13:52:00Z

@giuseppe-zappia I've seen you've opened PR#25 updating batch inference. Have you solved this as well in said update?

…-readmes improved readmes and added dev guide

dgallitelli mentioned this issue Jan 26, 2022

[BUG] Batch Inference: the Batch Transform definition is not updated with the Model Group name #7

Closed

This was referenced Sep 15, 2022

[Bug fix] Create the model object in a pipeline #45

Closed

added model name #46

Merged

acere pushed a commit to apac-ml-tfc/sagemaker-custom-project-templates that referenced this issue Jun 13, 2024

Merge pull request aws-samples#8 from ViktorMalesevic/feature/improve…

f4a9e5f

…-readmes improved readmes and added dev guide

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Batch-Inference Template - Model name in Pipeline Not being passed #8

[Bug] Batch-Inference Template - Model name in Pipeline Not being passed #8

georschi commented Jan 26, 2022

georschi commented Jan 26, 2022

dgallitelli commented Aug 23, 2022

[Bug] Batch-Inference Template - Model name in Pipeline Not being passed #8

[Bug] Batch-Inference Template - Model name in Pipeline Not being passed #8

Comments

georschi commented Jan 26, 2022

georschi commented Jan 26, 2022

dgallitelli commented Aug 23, 2022