[Question] Save internal state to disk #827

ngxson · 2023-04-07T08:15:24Z

Hi,

Thanks for your hard work on this project.

I've been playing with the code since few days. I'm trying to find a way to save the internal state of the model (or its context) that can be reused later. But until now it still doesn't work. I don't know if I'm missing something (I'm not good at all when talking about machine learning, I'm working more on system development.)

Here what I tried (inspired from code of main.cpp)

Load the model using llama_init_from_file
Call llama_eval on a prompt, for example " Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n"
Call llama_eval on a instruction, for example "### Instruction: Hi, I'm Xuan Son."
Save the kv_self buffer using llama_get_kv_cache => Expected: the saved data contains information about my name
Ask for what is my name?, the model correctly response that my name is Xuan Son
Exit the program
Re-run the program
Reload llama_init_from_file then kv_cache using llama_set_kv_cache => Expected: the loaded data contains information about my name
Ask for what is my name?, the model responses with nonsense words just love with 12 want some you, he saids

The text was updated successfully, but these errors were encountered:

chrfalch · 2023-04-07T17:50:17Z

You need to save the n_past value and (if used) the n_tokens_past (used for repeat penalty calculations). You can see an example here: #730 (comment)

ngxson · 2023-04-08T12:21:54Z

I forgot to save last_n (or n_past). Thank you for pointing it out!

It works like a charm now.

ngxson closed this as completed Apr 8, 2023

simsim314 mentioned this issue Apr 11, 2023

[Question/Improvement]Add Save/Load binding from llama.cpp nomic-ai/pygpt4all#56

Open

edp1096 mentioned this issue Apr 19, 2023

There is nothing to working example for kv_cache usage. #1054

Closed

Bearsaerker mentioned this issue Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Save internal state to disk #827

[Question] Save internal state to disk #827

ngxson commented Apr 7, 2023 •

edited

Loading

chrfalch commented Apr 7, 2023

ngxson commented Apr 8, 2023

[Question] Save internal state to disk #827

[Question] Save internal state to disk #827

Comments

ngxson commented Apr 7, 2023 • edited Loading

chrfalch commented Apr 7, 2023

ngxson commented Apr 8, 2023

ngxson commented Apr 7, 2023 •

edited

Loading