You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: components/backends/sglang/README.md
-6Lines changed: 0 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -94,12 +94,6 @@ cd $DYNAMO_ROOT/components/backends/sglang
94
94
95
95
### Aggregated Serving with KV Routing
96
96
97
-
> [!NOTE]
98
-
> The current implementation of `components/backends/sglang/src/dynamo/sglang/worker/main.py` publishes _placeholder_ engine metrics to keep the Dynamo KV-router happy. Real-time metrics will be surfaced directly from the SGLang engine once the following pull requests are merged:
Copy file name to clipboardExpand all lines: components/backends/trtllm/kv-cache-tranfer.md
+1-7Lines changed: 1 addition & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -61,10 +61,4 @@ To enable NIXL for KV cache transfer in disaggregated serving:
61
61
4.**Send the request:**
62
62
See [client](./README.md#client) section to learn how to send the request to deployment.
63
63
64
-
**Important:** Ensure that ETCD and NATS services are running before starting the service.
65
-
66
-
The container will automatically configure the appropriate environment variables (`TRTLLM_USE_NIXL_KVCACHE=1`) when built with the NIXL flag. The same container image can be used to use UCX for KV cache transfer.
67
-
```bash
68
-
unset TRTLLM_USE_NIXL_KVCACHE
69
-
export TRTLLM_USE_UCX_KVCACHE=1
70
-
```
64
+
**Important:** Ensure that ETCD and NATS services are running before starting the service.
0 commit comments