-
Notifications
You must be signed in to change notification settings - Fork 14.7k
Closed as not planned
Closed as not planned
Copy link
Labels
Description
Name and Version
version: 7122 (21d31e0)
built with cc (Debian 14.2.0-19) 14.2.0 for x86_64-linux-gnu
Operating systems
Linux
Which llama.cpp modules do you know to be affected?
llama-server
Command line
llama-server -m model.gguf -c 32768 -ngl 999 --reasoning-budget 0Problem description & steps to reproduce
When using the new Continue button in the WebUI in combination with a grammar, continuing the generation restarts the grammar parsing from the point of continuation. What should happen is that the grammar parsing should start from the beginning of the original input.
For example, with the JSON grammar
{
"json_schema": {
"type": "object",
"properties": {
"sentiment": {
"type": "string",
"enum": ["positive", "neutral", "negative"]
},
"confidence": {
"type": "number",
"minimum": 0,
"maximum": 1
},
"summary": {
"type": "string",
"maxLength": 200
}
},
"required": ["sentiment", "confidence", "summary"]
}
}
I say
How are you?
and get
{"sentiment": "positive", "confidence": 0.98, "summary": "The speaker seems to be in a good mood or is asking about the listener's well-being in a polite manner."}
If I delete after The speaker seems to and hit Continue, I get
{"sentiment": "positive", "confidence": 0.98, "summary": "The speaker seems to {"sentiment": "positive" , "confidence": 0.98, "summary": "The speaker seems to be asking a polite and casual greeting question to inquire about the listener's well-being." }