Skip to content

Commit 7fbd43a

Browse files
authored
docs: Update dynamo_glossary.md (#2082)
Signed-off-by: Anish <80174047+athreesh@users.noreply.github.com>
1 parent 3175b10 commit 7fbd43a

File tree

1 file changed

+3
-4
lines changed

1 file changed

+3
-4
lines changed

docs/dynamo_glossary.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -11,16 +11,12 @@
1111
## D
1212
**Decode Phase** - The second phase of LLM inference that generates output tokens one at a time.
1313

14-
**depends()** - A Dynamo function that creates dependencies between services, enabling automatic client generation and service discovery.
15-
1614
**Disaggregated Serving** - Dynamo's core architecture that separates prefill and decode phases into specialized engines to maximize GPU throughput and improve performance.
1715

1816
**Distributed Runtime** - Dynamo's Rust-based core system that manages service discovery, communication, and component lifecycle across distributed clusters.
1917

2018
**Dynamo** - NVIDIA's high-performance distributed inference framework for Large Language Models (LLMs) and generative AI models, designed for multinode environments with disaggregated serving and cache-aware routing.
2119

22-
**Dynamo Artifact** - A packaged archive containing an inference graph and its dependencies, created using `dynamo build`. It's the containerized, deployable version of a Graph.
23-
2420
**Dynamo Cloud** - A Kubernetes platform providing managed deployment experience for Dynamo inference graphs.
2521

2622
## E
@@ -80,5 +76,8 @@
8076
## V
8177
**vLLM** - High-throughput LLM serving engine with Ray distributed support and PagedAttention.
8278

79+
## W
80+
**Wide Expert Parallelism (WideEP)** - Mixture-of-Experts deployment strategy that spreads experts across many GPUs (e.g., 64-way EP) so each GPU hosts only a few experts.
81+
8382
## X
8483
**xPyD (x Prefill y Decode)** - Dynamo notation describing disaggregated serving configurations where x prefill workers serve y decode workers. Dynamo supports runtime-reconfigurable xPyD.

0 commit comments

Comments
 (0)