We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e5b6a05 commit b9cbee0Copy full SHA for b9cbee0
components/backends/vllm/LMCache_Integration.md
@@ -11,6 +11,10 @@ This document describes how LMCache is integrated into Dynamo's vLLM backend to
11
- **Memory Offloading**: Intelligent KV cache placement across CPU/GPU/storage tiers
12
- **Improved Throughput**: Reduced GPU memory pressure enables higher batch sizes
13
14
+## Platform Support
15
+
16
+**Important Note**: LMCache integration currently only supports x86 architecture. ARM64 is not supported at this time.
17
18
## Aggregated Serving
19
20
0 commit comments