Updated KV_pytorch and ORT inference for VLMs to incorporate image_idx #557

quic-dhirajku · 2025-09-10T08:18:10Z

Updated the run_vlm_kv_model_on_pytorch and run_vlm_kv_model_on_ort methods to run for the latest dual QPC setup. Along with the required changes to be made in the Input Handler of VLMs.

Also updated the way head_dim is calculated for past_key_value creation as certain models now provide specific head_dim. We fallback to previous method if the parameter isn't found in the config.

…ethods to run for the latest dual QPC setup. Along with the required changes to be made in the Input Handler of VLMs. Also updated the way head_dim is calculated for past_key_value creation as certain models now provide specific head_dim. We fallback to previous method if the parameter isn't found in the config. Signed-off-by: Dhiraj Kumar Sah <dhirajku@qti.qualcomm.com>

quic-dhirajku requested review from quic-rishinr, ochougul, quic-hemagnih and quic-amitraj as code owners September 10, 2025 08:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updated KV_pytorch and ORT inference for VLMs to incorporate image_idx #557

Updated KV_pytorch and ORT inference for VLMs to incorporate image_idx #557

quic-dhirajku commented Sep 10, 2025

Uh oh!

Uh oh!

Updated KV_pytorch and ORT inference for VLMs to incorporate image_idx #557

Are you sure you want to change the base?

Updated KV_pytorch and ORT inference for VLMs to incorporate image_idx #557

Conversation

quic-dhirajku commented Sep 10, 2025

Uh oh!

Uh oh!