diff --git a/README_GAUDI.md b/README_GAUDI.md index 5109f7ddf9927..644829210125c 100644 --- a/README_GAUDI.md +++ b/README_GAUDI.md @@ -81,13 +81,13 @@ Supported Features - Inference with [HPU Graphs](https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Inference_Using_HPU_Graphs.html) for accelerating low-batch latency and throughput +- Attention with Linear Biases (ALiBi) Unsupported Features ==================== - Beam search - LoRA adapters -- Attention with Linear Biases (ALiBi) - Quantization (AWQ, FP8 E5M2, FP8 E4M3) - Prefill chunking (mixed-batch inferencing) diff --git a/docs/source/getting_started/gaudi-installation.rst b/docs/source/getting_started/gaudi-installation.rst index 27bc0186675f1..328f9e723ec71 100644 --- a/docs/source/getting_started/gaudi-installation.rst +++ b/docs/source/getting_started/gaudi-installation.rst @@ -76,13 +76,13 @@ Supported Features - Tensor parallelism support for multi-card inference - Inference with `HPU Graphs `__ for accelerating low-batch latency and throughput +- Attention with Linear Biases (ALiBi) Unsupported Features ==================== - Beam search - LoRA adapters -- Attention with Linear Biases (ALiBi) - Quantization (AWQ, FP8 E5M2, FP8 E4M3) - Prefill chunking (mixed-batch inferencing)