Skip to content

Commit a070423

Browse files
ywang96Roger Wang
authored andcommitted
[Multimodal] Improve max video embedding length estimation in V1 (vllm-project#24312)
Signed-off-by: Roger Wang <hey@rogerw.me> Co-authored-by: Roger Wang <hey@rogerw.me> Signed-off-by: xuebwang-amd <xuebwang@amd.com>
1 parent a253a61 commit a070423

File tree

2 files changed

+2
-8
lines changed

2 files changed

+2
-8
lines changed

vllm/model_executor/models/llava_onevision.py

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -216,12 +216,9 @@ def get_num_frames_with_most_features(
216216
seq_len: int,
217217
mm_counts: Mapping[str, int],
218218
) -> int:
219-
max_images = mm_counts.get("image", 0)
220219
max_videos = mm_counts.get("video", 0)
221220

222-
max_image_tokens = self.get_max_image_tokens() * max_images
223-
max_total_frames = self._get_max_video_frames(seq_len -
224-
max_image_tokens)
221+
max_total_frames = self._get_max_video_frames(seq_len)
225222
max_frames_per_video = min(max_total_frames // max(max_videos, 1),
226223
_MAX_FRAMES_PER_VIDEO)
227224

vllm/model_executor/models/qwen2_vl.py

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -915,12 +915,9 @@ def get_num_frames_with_most_features(
915915
seq_len: int,
916916
mm_counts: Mapping[str, int],
917917
) -> int:
918-
max_images = mm_counts.get("image", 0)
919918
max_videos = mm_counts.get("video", 0)
920919

921-
max_image_tokens = self.get_max_image_tokens() * max_images
922-
max_total_frames = self._get_max_video_frames(seq_len -
923-
max_image_tokens)
920+
max_total_frames = self._get_max_video_frames(seq_len)
924921
max_frames_per_video = min(max_total_frames // max(max_videos, 1),
925922
_MAX_FRAMES_PER_VIDEO)
926923

0 commit comments

Comments
 (0)