feat: better batching in ingest method #348

mhordynski · 2025-02-11T09:55:04Z

Feature description

Right now ingest method flow looks like this:

document processing(chunking) is controlled by ProcessingExecutionStrategy
All elements are grouped together
All embeddings are calculated at once

We should also batch requests to embedding model and vector database. One way is to move embedding&inserting into ProcessingExecutionStrategy.

For errors we should decide how to handle failed documents, for example create a default implementation gathering all failed documents into a list. User should be able to create custom error handling strategy.

Motivation

Current behaviour causes problems with rate limits / memory usage / too big requests to embedding models.

Additional context

No response

mhordynski added the feature New feature or request label Feb 11, 2025

mhordynski self-assigned this Feb 11, 2025

mhordynski added this to ragbits Feb 11, 2025

mhordynski moved this to Backlog in ragbits Feb 11, 2025

mhordynski mentioned this issue Feb 11, 2025

enabler: propose concept how to batch ingest method #349

Open

mhordynski added stable-release and removed stable-release labels Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: better batching in ingest method #348

feat: better batching in ingest method #348

mhordynski commented Feb 11, 2025 •

edited

Loading

feat: better batching in ingest method #348

feat: better batching in ingest method #348

Comments

mhordynski commented Feb 11, 2025 • edited Loading

Feature description

Motivation

Additional context

mhordynski commented Feb 11, 2025 •

edited

Loading