[feat] Enable chunked prefill for llava-onevision #2412

Ying1123 · 2024-12-08T23:25:45Z

Need to update for qwen2-vl and mllama.

vchzls · 2024-12-26T07:34:17Z

Hello, if the input image is very large, is it likely that the image will be encoded multiple times?

Ying1123 · 2024-12-31T22:21:19Z

Hello, if the input image is very large, is it likely that the image will be encoded multiple times?

Hi @vchzls, yes, this part needs to be optimized. @JamesSand

Ying1123 requested review from merrymercy, zhyncs, hnyls2002, ispobock and ByronHsu as code owners December 8, 2024 23:25

merrymercy force-pushed the main branch from 1ad76cd to 835f8af Compare December 9, 2024 07:31

Ying1123 added 2 commits December 9, 2024 04:03

enable chunked prefill for llava-onevision

bc412a9

fix

f1757f7

Ying1123 force-pushed the ying-image-chunk branch from 6b87ee9 to f1757f7 Compare December 9, 2024 12:03

Ying1123 merged commit 8586b72 into main Dec 9, 2024
17 checks passed

Ying1123 deleted the ying-image-chunk branch December 9, 2024 17:52

Provide feedback