Fix kv cache data pointers #1104

xaedes · 2023-04-21T15:13:24Z

Currently the functions to set the kv_cache will overwrite the data pointers of the k and v tensors, as the pointer address is stored in the memory block (kv_self.buf) itself and then overwritten by memcpy.

Restoring the cache only works correctly when restoring from the same runtime session as the data pointers will not have changed.
I saw folks testing the kv_cache get and set by freeing the kv_cache ggml context, then making a new context and restoring to that. Probably the same memory block was allocated in the second context, so that it did not segfault.

When storing cache to file, restarting program and loading cache the pointers will be wrong and llama_eval will segfault.

To fix the problem, I remember the data pointers before memcpy overwrites kv_self.buf and then just restore them.

because their value is stored in buf and overwritten by memcpy

ggerganov

Smart solution!

…fixes ggml-org#1104) Signed-off-by: Jeroen Mostert <jeroen.mostert@cm.com>

remember and restore kv cache data pointers

283156c

because their value is stored in buf and overwritten by memcpy

ggerganov approved these changes Apr 21, 2023

View reviewed changes

ggerganov merged commit 8687c1f into ggml-org:master Apr 21, 2023

CRD716 mentioned this pull request Apr 21, 2023

Returning control to vicuna causes garbage results #1110

Closed

4 tasks

jeroen-mostert added a commit to jeroen-mostert/llama.cpp that referenced this pull request Aug 30, 2024

RocM: filter VRAM fetch by HIP_VISIBLE_DEVICES / CUDA_VISIBLE_DEVICES (…

2282fb5

…fixes ggml-org#1104) Signed-off-by: Jeroen Mostert <jeroen.mostert@cm.com>

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix kv cache data pointers #1104

Fix kv cache data pointers #1104

Uh oh!

xaedes commented Apr 21, 2023

Uh oh!

ggerganov left a comment

Uh oh!

Uh oh!

Fix kv cache data pointers #1104

Fix kv cache data pointers #1104

Uh oh!

Conversation

xaedes commented Apr 21, 2023

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!