server : fix crash when prompt exceeds context size #3996

z80maniac · 2023-11-08T18:42:24Z

This fixes #3817.

In the current version a prompt gets properly truncated only when the cache_prompt is set. This PR just moves the entire truncation logic up one level, so now the prompt gets always truncated if it exceeds the context size.

server : fix crash when prompt exceeds context size

Verified

This commit was signed with the committer’s verified signature.

jcubic Jakub T. Jankiewicz

GPG key ID: A58EE6F131F83013

Verified
Learn about vigilant mode

cba6180

jhen0409 approved these changes Nov 8, 2023

View reviewed changes

jhen0409 mentioned this pull request Nov 10, 2023

server: fix core dump when input prompt larger than prompt context #4022

Closed

jhen0409 merged commit d96ca7d into ggml-org:master Nov 11, 2023

olexiyb pushed a commit to Sanctum-AI/llama.cpp that referenced this pull request Nov 23, 2023

server : fix crash when prompt exceeds context size (ggml-org#3996)

ab63c7c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server : fix crash when prompt exceeds context size #3996

server : fix crash when prompt exceeds context size #3996

z80maniac commented Nov 8, 2023

server : fix crash when prompt exceeds context size #3996

server : fix crash when prompt exceeds context size #3996

Conversation

z80maniac commented Nov 8, 2023