ai-dynamo · nnshah1 · Jul 8, 2025 · Jul 2, 2025 · Jul 2, 2025 · Jul 2, 2025
diff --git a/components/planner/README.md b/components/planner/README.md
@@ -124,6 +124,4 @@ For manual testing, you can use the controller_test.py file to add/remove compon
 
 The Kubernetes backend works by updating the replicas count of the DynamoGraphDeployment custom resource. When the planner determines that workers need to be scaled up or down based on workload metrics, it uses the Kubernetes API to patch the DynamoGraphDeployment resource specification, changing the replicas count for the appropriate worker component. The Kubernetes operator then reconciles this change by creating or terminating the necessary pods. This provides a seamless autoscaling experience in Kubernetes environments without requiring manual intervention.
 
-The Kubernetes backend will automatically be used by Planner when your pipeline is deployed with `dynamo deployment create`. By default, the planner will run in no-op mode, which means it will monitor metrics but not take scaling actions. To enable actual scaling, you should also specify `--Planner.no-operation=false`.
-
-
+The Kubernetes backend will automatically be used by Planner when your pipeline is deployed using a DynamoGraphDeployment CR. By default, the planner will run in no-op mode, which means it will monitor metrics but not take scaling actions. To enable actual scaling, you should also specify `--Planner.no-operation=false`.
@@ -32,11 +32,11 @@ export DYNAMO_TAG=$(dynamo build graphs.agg:Frontend | grep "Successfully built"
 ```bash
 # Deploy first graph
 export DEPLOYMENT_NAME=llm-agg1
-dynamo deployment create $DYNAMO_TAG -n $DEPLOYMENT_NAME -f ./configs/agg.yaml
+# TODO: Deploy your service using a DynamoGraphDeployment CR.
 
 # Deploy second graph
 export DEPLOYMENT_NAME=llm-agg2
-dynamo deployment create $DYNAMO_TAG -n $DEPLOYMENT_NAME -f ./configs/agg.yaml
+# TODO: Deploy your service using a DynamoGraphDeployment CR.
 ```
 
 3. **Deploy Inference Gateway**

@@ -23,8 +23,6 @@
 from rich.console import Console
 
 from dynamo.sdk.cli.build import build
-from dynamo.sdk.cli.deployment import app as deployment_app
-from dynamo.sdk.cli.deployment import deploy
 from dynamo.sdk.cli.env import env
 from dynamo.sdk.cli.run import run
 from dynamo.sdk.cli.serve import serve
@@ -76,8 +74,6 @@ def main(
     context_settings={"allow_extra_args": True, "ignore_unknown_options": True},
     add_help_option=False,
 )(run)
-cli.add_typer(deployment_app, name="deployment")
-cli.command()(deploy)
 cli.command()(build)
 
 if __name__ == "__main__":
Original file line number	Diff line number	Diff line change
Expand Up		@@ -124,6 +124,4 @@ For manual testing, you can use the controller_test.py file to add/remove compon

		The Kubernetes backend works by updating the replicas count of the DynamoGraphDeployment custom resource. When the planner determines that workers need to be scaled up or down based on workload metrics, it uses the Kubernetes API to patch the DynamoGraphDeployment resource specification, changing the replicas count for the appropriate worker component. The Kubernetes operator then reconciles this change by creating or terminating the necessary pods. This provides a seamless autoscaling experience in Kubernetes environments without requiring manual intervention.

		The Kubernetes backend will automatically be used by Planner when your pipeline is deployed with `dynamo deployment create`. By default, the planner will run in no-op mode, which means it will monitor metrics but not take scaling actions. To enable actual scaling, you should also specify `--Planner.no-operation=false`.


		The Kubernetes backend will automatically be used by Planner when your pipeline is deployed using a DynamoGraphDeployment CR. By default, the planner will run in no-op mode, which means it will monitor metrics but not take scaling actions. To enable actual scaling, you should also specify `--Planner.no-operation=false`.
atchernych marked this conversation as resolved. Show resolved Hide resolved