docs: Update dynamo_glossary.md (#2082)

athreesh · web-flow · commit 7fbd43ae77b3 · 2025-07-29T08:18:13.000-07:00
Signed-off-by: Anish &lt;80174047+athreesh@users.noreply.github.com&gt;
diff --git a/docs/dynamo_glossary.md b/docs/dynamo_glossary.md
@@ -11,16 +11,12 @@
 ## D
 **Decode Phase** - The second phase of LLM inference that generates output tokens one at a time.
 
-**depends()** - A Dynamo function that creates dependencies between services, enabling automatic client generation and service discovery.
-
 **Disaggregated Serving** - Dynamo's core architecture that separates prefill and decode phases into specialized engines to maximize GPU throughput and improve performance.
 
 **Distributed Runtime** - Dynamo's Rust-based core system that manages service discovery, communication, and component lifecycle across distributed clusters.
 
 **Dynamo** - NVIDIA's high-performance distributed inference framework for Large Language Models (LLMs) and generative AI models, designed for multinode environments with disaggregated serving and cache-aware routing.
 
-**Dynamo Artifact** - A packaged archive containing an inference graph and its dependencies, created using `dynamo build`. It's the containerized, deployable version of a Graph.
-
 **Dynamo Cloud** - A Kubernetes platform providing managed deployment experience for Dynamo inference graphs.
 
 ## E
@@ -80,5 +76,8 @@
 ## V
 **vLLM** - High-throughput LLM serving engine with Ray distributed support and PagedAttention.
 
+## W
+**Wide Expert Parallelism (WideEP)** - Mixture-of-Experts deployment strategy that spreads experts across many GPUs (e.g., 64-way EP) so each GPU hosts only a few experts.
+
 ## X
 **xPyD (x Prefill y Decode)** - Dynamo notation describing disaggregated serving configurations where x prefill workers serve y decode workers. Dynamo supports runtime-reconfigurable xPyD.