new: add local inference batch size #894

joein · 2025-01-29T14:50:00Z

No description provided.

* wip: add draft implementation of batch processing * fix: embed dict and list of docs, remove redundant code * new: regen async, small refactor * refactor: add docstrings, rename methods * Upload points local inference (#881) * new: separate single and plural model embeddings * fix: fix lazy embed models * new: add inference object inspections to upload methods * wip: local inference upload parallel * new: add local inference to upload points and upload collection, refactor mixin * fix: remove redundant code * redundant import * tests: check is query for query points batch * refactor: refactor semi ordered map * tests: add test for local inference with batches with docs and vectors * tests: check the order of dict processing * new: distinguish models by options * fix: fix typing * fix: fix types * new: embed batches with different options * tests: add tests for batch with different options * fix: ignore ide incorrect type inspection * tests: wait for points to be inserted * fix: set threads to 1 in parallel inference * new: adjust max internal batch size * fix: fix type hints * function to get embeddings size (#892) * function to get embeddings size * async client * keep sync * new: extend embedding size to support image and late interaction models --------- Co-authored-by: George Panchuk <george.panchuk@qdrant.tech> * new: add local inference batch size (#894) --------- Co-authored-by: Andrey Vasnetsov <andrey@vasnetsov.com>

new: add local inference batch size

1ba8fa5

generall approved these changes Jan 29, 2025

View reviewed changes

joein merged commit 0e3ae86 into local-inference-batch Jan 29, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new: add local inference batch size #894

new: add local inference batch size #894

joein commented Jan 29, 2025

new: add local inference batch size #894

new: add local inference batch size #894

Conversation

joein commented Jan 29, 2025