fix #1224 reverse prompt and multi line #1297

newTomas · 2023-05-03T11:49:48Z

Fixes #1224
In the original code, when an empty line is encountered, it stops the user from entering, although in reality, an empty line can be a side effect of the previous input and should simply be ignored.

DeveloperOl · 2023-05-03T12:33:40Z

works great for me :)

newTomas · 2023-05-03T13:18:52Z

Tested with a wider set of models and found 1 more bug in another part of the code. If the antiprompt is "User: ", and the model generated the tokens "User", ":", " How", then the program takes the last n characters, where n is the length of the antiprompt, and it turns out that it compares "User: " and "r: How" and does the conclusion that the antiprompt was not met.
I'll try to make a fix tomorrow. Or someone else can do it, it's a bit more difficult than the previous bug.
https://github.com/ggerganov/llama.cpp/blob/master/examples/main/main.cpp#L514

examples/main/main.cpp

ggerganov · 2023-05-03T16:06:17Z

@newTomas

The fix for this issue is not very trivial.
One has to check for the described situation and decrease n_past by 1. And also somehow erase the extra generated token (i.e. " How" in your example).

To resolve this, we introduced the --input-prefix argument that can be used to inject the extra space.
Let's merge this PR like it is after the indentation fixes

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

newTomas · 2023-05-04T00:46:03Z

@ggerganov
I approved your commit. Let's merge

* fix reverse prompt and multi line * Code Formatting Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

agronholm · 2023-05-07T23:48:39Z

Unfortunately, I discovered that not only did this not fix the problem, it made it a whole lot worse. Previously, I was able to at least get it to produce more tokens by pressing enter. Now even that won't work.

fix reverse prompt and multi line

b78af37

newTomas mentioned this pull request May 3, 2023

Llama Ignoring Reverse Prompt Every Other Time #1224

Closed

4 tasks

newTomas changed the title ~~fix reverse prompt and multi line~~ fix #1224 reverse prompt and multi line May 3, 2023

ggerganov approved these changes May 3, 2023

View reviewed changes

examples/main/main.cpp Outdated Show resolved Hide resolved

ggerganov mentioned this pull request May 3, 2023

read chat prompts from a template file #1196

Merged

Code Formatting

932e616

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

DannyDaemonic merged commit f647ce0 into ggml-org:master May 4, 2023

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix #1224 reverse prompt and multi line #1297

fix #1224 reverse prompt and multi line #1297

newTomas commented May 3, 2023 •

edited

Loading

DeveloperOl commented May 3, 2023

newTomas commented May 3, 2023 •

edited

Loading

ggerganov commented May 3, 2023

newTomas commented May 4, 2023 •

edited

Loading

agronholm commented May 7, 2023

fix #1224 reverse prompt and multi line #1297

fix #1224 reverse prompt and multi line #1297

Conversation

newTomas commented May 3, 2023 • edited Loading

DeveloperOl commented May 3, 2023

newTomas commented May 3, 2023 • edited Loading

ggerganov commented May 3, 2023

newTomas commented May 4, 2023 • edited Loading

agronholm commented May 7, 2023

newTomas commented May 3, 2023 •

edited

Loading

newTomas commented May 3, 2023 •

edited

Loading

newTomas commented May 4, 2023 •

edited

Loading