Check if kv_cache is tuple before calling split_kv_cache #697

kdamaszk · 2025-01-17T08:13:11Z

After last rebase mllama is broken. During the prompt kv_cache is a torch.Tensor([0]) instead of None like before. It is probably caused by new function bind_kv_cache that is called during preparation of inputs.
This fix is checking also if kv_cache is a tuple before calling split_kv_cache function.

madamczykhabana

lgtm

Check if kv_cache is tuple before calling split_kv_cache

27d7150

kdamaszk requested review from kzawora-intel, madamczykhabana, michalkuligowski, mgawarkiewicz, vivekgoe and afierka-intel as code owners January 17, 2025 08:13

kdamaszk added the habana Issues or PRs submitted by Habana Labs label Jan 17, 2025

madamczykhabana approved these changes Jan 17, 2025

View reviewed changes

kdamaszk merged commit a685225 into habana_main Jan 17, 2025
33 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check if kv_cache is tuple before calling split_kv_cache #697

Check if kv_cache is tuple before calling split_kv_cache #697

kdamaszk commented Jan 17, 2025 •

edited by github-actions bot

Loading

madamczykhabana left a comment

Check if kv_cache is tuple before calling split_kv_cache #697

Check if kv_cache is tuple before calling split_kv_cache #697

Conversation

kdamaszk commented Jan 17, 2025 • edited by github-actions bot Loading

madamczykhabana left a comment

Choose a reason for hiding this comment

kdamaszk commented Jan 17, 2025 •

edited by github-actions bot

Loading