server: Completion of pre-tokenized prompt is broken #4476

shibe2 · 2023-12-14T19:18:13Z

Prerequisites

Please answer the following questions for yourself before submitting an issue.

I am running the latest code: cafcd4f
I carefully followed the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
I reviewed the Discussions, and have a new bug or useful enhancement to share.

Expected Behavior

According to documentation:

Line 117 in cafcd4f

    
           `prompt`: Provide the prompt for this completion as a string or as an array of strings or numbers representing tokens. Internally, the prompt is compared to the previous completion and only the "unseen" suffix is evaluated. If the prompt is a string or an array with the first element given as a string, a `bos` token is inserted in the front like `main` does.

or as an array of strings or numbers representing tokens

Current Behavior

When supplying the prompt as array of token identifiers, it instead calls split_multiprompt_task and the request hangs.

Steps to Reproduce

Call /tokenize with a text prompt in content.
Add BOS if needed.
Call /completion with the resulting array in prompt.

Failure Logs

all slots are idle and system prompt is empty, clear the KV cache

slot 0 is processing [task id: 2]
slot unavailable

print_timings: prompt eval time = 0.00 ms / 0 tokens ( -nan ms per token, -nan tokens per second)
print_timings: eval time = -94366367288.92 ms / 0 runs ( -inf ms per token, -0.00 tokens per second)
print_timings: total time = -94366367288.92 ms
slot unavailable

The text was updated successfully, but these errors were encountered:

jxy · 2023-12-21T20:18:39Z

It took me sometime to realize what's wrong with my server.

I originally added the prompt array support for testing prompts with specifically selected tokens, which has been quite useful, as the prompts subject no constraint of any tokenizer.

In order to support both usage, how about allowing 2-level nested prompts? For example,

{
"prompt": [
    "First prompt",
    [1, "second prompt provided in an array.", 2, 1, "What do you think?"],
    "Third prompt"
]
}

This would be compatible with the multi-prompt change already introduced, and allow for array prompts.

jxy · 2023-12-22T03:42:18Z

I'm baffled. I can't get the current multi-prompt to work. #4583 I'll wait for that to be fixed and introduce new behaviors. For now, using #4232 (comment) to get back the previous behavior.

shibe2 · 2023-12-22T17:33:50Z

I would like it better if multi-prompt field was called "prompts". It can then have sub-arrays as in your example, @jxy. The format of "prompt" field can be made to match the current documentation, i.e. single prompt.

github-actions · 2024-03-18T01:36:11Z

This issue is stale because it has been open for 30 days with no activity.

github-actions · 2024-04-02T01:11:00Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

shibe2 added the bug-unconfirmed label Dec 14, 2023

shibe2 mentioned this issue Dec 16, 2023

Add single-client multi-prompt support #4232

Merged

shibe2 added bug Something isn't working and removed bug-unconfirmed labels Dec 22, 2023

jxy mentioned this issue Feb 3, 2024

Server: various fixes for the prompt field in /completion #5300

Merged

github-actions bot added the stale label Mar 18, 2024

github-actions bot closed this as completed Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server: Completion of pre-tokenized prompt is broken #4476

server: Completion of pre-tokenized prompt is broken #4476

shibe2 commented Dec 14, 2023

jxy commented Dec 21, 2023

jxy commented Dec 22, 2023

shibe2 commented Dec 22, 2023

github-actions bot commented Mar 18, 2024

github-actions bot commented Apr 2, 2024

server: Completion of pre-tokenized prompt is broken #4476

server: Completion of pre-tokenized prompt is broken #4476

Comments

shibe2 commented Dec 14, 2023

Prerequisites

Expected Behavior

Current Behavior

Steps to Reproduce

Failure Logs

jxy commented Dec 21, 2023

jxy commented Dec 22, 2023

shibe2 commented Dec 22, 2023

github-actions bot commented Mar 18, 2024

github-actions bot commented Apr 2, 2024