@@ -49,22 +49,22 @@ The examples below assume you build the latest image yourself from source. If us
4949 :link: /examples/hello_world
5050 :link-type: doc
5151
52- Demonstrates the basic concepts of Dynamo by creating a simple multi-service pipeline.
52+ Demonstrates the basic concepts of Dynamo by creating a simple GPU-unaware graph
5353
54- .. grid-item-card :: :doc:`LLM Deployment </examples/llm_deployment >`
55- :link: /examples/llm_deployment
54+ .. grid-item-card :: :doc:`LLM Serving with VLLM </components/backends/vllm >`
55+ :link: /components/backends/vllm
5656 :link-type: doc
5757
58- Presents examples and reference implementations for deploying Large Language Models (LLMs) in various configurations.
58+ Presents examples and reference implementations for deploying Large Language Models (LLMs) in various configurations with VLLM .
5959
60- .. grid-item-card :: :doc:`Multinode </examples/ multinode>`
61- :link: /examples/ multinode
60+ .. grid-item-card :: :doc:`Multinode with SGLang </components/backends/sglang/docs/ multinode-examples >`
61+ :link: /components/backends/sglang/docs/ multinode-examples
6262 :link-type: doc
6363
64- Demonstrates deployment for disaggregated serving on 3 nodes using ` nvidia/Llama-3.1-405B-Instruct-FP8 ` .
64+ Demonstrates disaggregated serving on several nodes.
6565
66- .. grid-item-card :: :doc:`TensorRT-LLM </examples /trtllm>`
67- :link: /examples /trtllm
66+ .. grid-item-card :: :doc:`TensorRT-LLM </components/backends /trtllm>`
67+ :link: /components/backends /trtllm
6868 :link-type: doc
6969
7070 Presents TensorRT-LLM examples and reference implementations for deploying Large Language Models (LLMs) in various configurations.
@@ -110,7 +110,7 @@ The examples below assume you build the latest image yourself from source. If us
110110
111111 Dynamo Deploy Quickstart <guides/dynamo_deploy/quickstart.md >
112112 Dynamo Cloud Kubernetes Platform <guides/dynamo_deploy/dynamo_cloud.md >
113- Manual Helm Deployment <guides/dynamo_deploy/manual_helm_deployment .md >
113+ Manual Helm Deployment <deploy/helm/README .md >
114114 GKE Setup Guide <guides/dynamo_deploy/gke_setup.md >
115115 Minikube Setup Guide <guides/dynamo_deploy/minikube.md >
116116 Model Caching with Fluid <guides/dynamo_deploy/model_caching_with_fluid.md >
@@ -126,17 +126,14 @@ The examples below assume you build the latest image yourself from source. If us
126126 :hidden:
127127 :caption: API
128128
129- Python API <API/python_bindings.md >
130129 NIXL Connect API <API/nixl_connect/README.md >
131130
132131.. toctree ::
133132 :hidden:
134133 :caption: Examples
135134
136- Aggregated and Disaggregated Deployment <examples/disagg_skeleton.md >
137- LLM Deployment Examples <examples/llm_deployment.md >
138- Multinode Examples <examples/multinode.md >
139- LLM Deployment Examples using TensorRT-LLM <examples/trtllm.md >
135+ Multinode Examples </components/backends/sglang/docs/multinode-examples.md >
136+ LLM Deployment Examples using TensorRT-LLM <components/backends/trtllm/README.md >
140137
141138.. toctree ::
142139 :hidden:
0 commit comments