Skip to content

Commit b9cbee0

Browse files
docs: Add note for LMCache ARM support (#2535)
Co-authored-by: Dmitry Tokarev <dtokarev@nvidia.com>
1 parent e5b6a05 commit b9cbee0

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

components/backends/vllm/LMCache_Integration.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,10 @@ This document describes how LMCache is integrated into Dynamo's vLLM backend to
1111
- **Memory Offloading**: Intelligent KV cache placement across CPU/GPU/storage tiers
1212
- **Improved Throughput**: Reduced GPU memory pressure enables higher batch sizes
1313

14+
## Platform Support
15+
16+
**Important Note**: LMCache integration currently only supports x86 architecture. ARM64 is not supported at this time.
17+
1418
## Aggregated Serving
1519

1620

0 commit comments

Comments
 (0)