server: multimodal - fix misreported prompt and num prompt tokens #392

cjpais · 2024-05-02T21:54:46Z

Should address llama.cpp 5852 and llama.cpp 5863

To fix, we set the number of tokens processed to it's correct value in ingest_images where the prompt is tokenized for multimodal.

Additionally a fix for the prompt being set to the empty string for multimodal responses. Basically we iteratively rebuild the initial prompt since it was cleared.

jart

Thank you! I was wondering about that.

server: multimodal - fix misreported prompt and num prompt tokens

90f7203

jart approved these changes May 7, 2024

View reviewed changes

jart merged commit a2d159e into Mozilla-Ocho:main May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server: multimodal - fix misreported prompt and num prompt tokens #392

server: multimodal - fix misreported prompt and num prompt tokens #392

cjpais commented May 2, 2024

jart left a comment

server: multimodal - fix misreported prompt and num prompt tokens #392

server: multimodal - fix misreported prompt and num prompt tokens #392

Conversation

cjpais commented May 2, 2024

jart left a comment

Choose a reason for hiding this comment