@@ -45,26 +45,26 @@ The examples below assume you build the latest image yourself from source. If us
4545 :margin: 0
4646 :padding: 3 4 0 0
4747
48- .. grid-item-card :: :doc:`Hello World </ examples/hello_world>`
49- :link: / examples/hello_world
48+ .. grid-item-card :: :doc:`Hello World <examples/runtime/ hello_world/README >`
49+ :link: examples/runtime/ hello_world/README
5050 :link-type: doc
5151
52- Demonstrates the basic concepts of Dynamo by creating a simple multi-service pipeline.
52+ Demonstrates the basic concepts of Dynamo by creating a simple GPU-unaware graph
5353
54- .. grid-item-card :: :doc:`LLM Deployment </examples/llm_deployment >`
55- :link: /examples/llm_deployment
54+ .. grid-item-card :: :doc:`LLM Serving with VLLM <components/backends/vllm/README >`
55+ :link: components/backends/vllm/README
5656 :link-type: doc
5757
58- Presents examples and reference implementations for deploying Large Language Models (LLMs) in various configurations.
58+ Presents examples and reference implementations for deploying Large Language Models (LLMs) in various configurations with VLLM .
5959
60- .. grid-item-card :: :doc:`Multinode </examples/ multinode>`
61- :link: /examples/ multinode
60+ .. grid-item-card :: :doc:`Multinode with SGLang <components/backends/sglang/docs/ multinode-examples >`
61+ :link: components/backends/sglang/docs/ multinode-examples
6262 :link-type: doc
6363
64- Demonstrates deployment for disaggregated serving on 3 nodes using ` nvidia/Llama-3.1-405B-Instruct-FP8 ` .
64+ Demonstrates disaggregated serving on several nodes.
6565
66- .. grid-item-card :: :doc:`TensorRT-LLM </examples /trtllm>`
67- :link: /examples /trtllm
66+ .. grid-item-card :: :doc:`TensorRT-LLM <components/backends /trtllm/README >`
67+ :link: components/backends /trtllm/README
6868 :link-type: doc
6969
7070 Presents TensorRT-LLM examples and reference implementations for deploying Large Language Models (LLMs) in various configurations.
@@ -110,7 +110,7 @@ The examples below assume you build the latest image yourself from source. If us
110110
111111 Dynamo Deploy Quickstart <guides/dynamo_deploy/quickstart.md >
112112 Dynamo Cloud Kubernetes Platform <guides/dynamo_deploy/dynamo_cloud.md >
113- Manual Helm Deployment <guides/dynamo_deploy/manual_helm_deployment .md >
113+ Manual Helm Deployment <deploy/helm/README .md >
114114 GKE Setup Guide <guides/dynamo_deploy/gke_setup.md >
115115 Minikube Setup Guide <guides/dynamo_deploy/minikube.md >
116116 Model Caching with Fluid <guides/dynamo_deploy/model_caching_with_fluid.md >
@@ -126,22 +126,22 @@ The examples below assume you build the latest image yourself from source. If us
126126 :hidden:
127127 :caption: API
128128
129- Python API <API/python_bindings.md >
130129 NIXL Connect API <API/nixl_connect/README.md >
131130
132131.. toctree ::
133132 :hidden:
134133 :caption: Examples
135134
136- Aggregated and Disaggregated Deployment <examples/disagg_skeleton .md >
137- LLM Deployment Examples < examples/llm_deployment .md >
138- Multinode Examples < examples/ multinode.md >
139- LLM Deployment Examples using TensorRT-LLM <examples/ trtllm.md >
135+ Hello World <examples/runtime/hello_world/README .md >
136+ LLM Deployment Examples using VLLM < components/backends/vllm/README .md >
137+ Multinode Examples using SGLang < components/backends/sglang/docs/ multinode-examples .md >
138+ LLM Deployment Examples using TensorRT-LLM <components/backends/ trtllm/README .md >
140139
141140.. toctree ::
142141 :hidden:
143142 :caption: Reference
144143
144+
145145 Glossary <dynamo_glossary.md >
146146 KVBM Reading <architecture/kvbm_reading.md >
147147
0 commit comments