[Feature Request] Add batch processing for input prompt data in embedding mode. #893

AL-Kost · 2023-04-11T08:15:29Z

It would be nice to add batch processing for input prompt data in embedding mode.
I.e., read prompts from a file and output a map of prompts and their embeddings.
It seems that this can be done by modifying the logic of the -f flag for embedding mode.

The text was updated successfully, but these errors were encountered:

rjadr · 2023-04-13T06:57:13Z

I second this. I've been working on integrating llama.cpp in langchain, but the retrieval of embeddings is terribly slow since we can only pass single strings (for which the model is loaded anew every time). Batch processing embeddings would be very helpful here, preferably by being able to pass a list of strings in the CLI.

github-actions · 2024-04-11T01:06:43Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

AL-Kost changed the title ~~[Feature Request] Add the ability to work with batches of input prompt data in embedding mode.~~ [Feature Request] Add batch processing for input prompt data in embedding mode. Apr 11, 2023

github-actions bot added the stale label Mar 25, 2024

github-actions bot closed this as completed Apr 11, 2024

Bearsaerker mentioned this issue Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Add batch processing for input prompt data in embedding mode. #893

[Feature Request] Add batch processing for input prompt data in embedding mode. #893

AL-Kost commented Apr 11, 2023 •

edited

Loading

rjadr commented Apr 13, 2023

github-actions bot commented Apr 11, 2024

[Feature Request] Add batch processing for input prompt data in embedding mode. #893

[Feature Request] Add batch processing for input prompt data in embedding mode. #893

Comments

AL-Kost commented Apr 11, 2023 • edited Loading

rjadr commented Apr 13, 2023

github-actions bot commented Apr 11, 2024

AL-Kost commented Apr 11, 2023 •

edited

Loading