Skip to content

Commit 2945178

Browse files
committed
rebase
Signed-off-by: qizixi <qizixi@meta.com>
1 parent 0e1d9af commit 2945178

File tree

1 file changed

+2
-3
lines changed

1 file changed

+2
-3
lines changed

vllm/v1/worker/gpu_model_runner.py

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -34,9 +34,8 @@
3434
from vllm.sampling_params import SamplingType
3535
from vllm.sequence import IntermediateTensors
3636
from vllm.utils import (STR_DTYPE_TO_TORCH_DTYPE, DeviceMemoryProfiler,
37-
GiB_bytes, LayerBlockType, LazyLoader,
38-
async_tensor_h2d, cdiv, check_use_alibi,
39-
is_pin_memory_available)
37+
GiB_bytes, LazyLoader, async_tensor_h2d, cdiv,
38+
check_use_alibi, is_pin_memory_available)
4039
from vllm.v1.attention.backends.flash_attn import FlashAttentionMetadata
4140
from vllm.v1.attention.backends.utils import CommonAttentionMetadata
4241
from vllm.v1.core.encoder_cache_manager import compute_encoder_budget

0 commit comments

Comments
 (0)