feat: support multiple text embedding providers #76

peter-gy · 2025-10-22T06:41:41Z

Summary

Adds support for multiple text embedding providers through LiteLLM integration, enabling users to leverage API-based models (OpenAI, Cohere, Azure, Ollama, etc.) alongside the existing local SentenceTransformers approach. compute_text_projection() API remains 100% backward compatible; new parameters are optional.

Changes

New CLI Options: Added --text-projector to route between litellm and sentence_transformers, and exposed --api-key, --api-base, --dimensions, and --sync flags for LiteLLM-specific configuration
Provider Abstraction: Introduced TextProjectorCallback type and provider-specific implementations (_project_text_with_sentence_transformers, _project_text_with_litellm) so that we can keep benefitting from the existing caching approach regardless of the model used to compute the projections
Examples: Added notebook cells demonstrating Ollama (locally-served API) and OpenAI (remote API) embedding workflows

Testing

Verified in packages/backend/examples/notebook.ipynb with:

SentenceTransformers (default, backward compatibility)
Ollama API (nomic-embed-text)
OpenAI API (text-embedding-3-small)

donghaoren

Thanks for the new addition! A couple of comments.

donghaoren · 2025-10-28T18:55:04Z

packages/backend/pyproject.toml

  "llvmlite >= 0.43.0",
  "accelerate >= 1.5.0",
  "tqdm >= 4.60.0",
+  "litellm>=1.78.5",


Suggested change

"litellm>=1.78.5",

"litellm >= 1.78.5",

donghaoren · 2025-10-28T20:16:54Z

packages/backend/embedding_atlas/projection.py

+        text_projector = _project_text_with_sentence_transformers
+
    hasher = Hasher()
    hasher.update(


In _projection_for_texts, we have a caching mechanism that computes a hash from the parameters and use the hash as a cache filename. It seems like we are not taking into account all text projector parameters (e.g., the text projector type, dimensions). Could you update the code to include the text projector type (as string) and args into the hasher, and increment the "version" number?

Of course. Thanks for pointing it out. Addressed this in 823a82b.

peter-gy added 9 commits October 22, 2025 10:25

chore: add litellm as dependency

ac9e9d1

feat: support LiteLLM-compatible embedding models

caec65f

docs: add example notebook

658d0c9

fix: ensure async batches are awaited

570a9a9

feat: expose options via CLI

0c2e060

docs: keep single example notebook

649268a

style: format code

ff401ad

docs: comment wording

ee79549

docs: remove noise

39ddb55

donghaoren reviewed Oct 28, 2025

View reviewed changes

donghaoren requested a review from domoritz October 28, 2025 20:19

peter-gy added 2 commits October 29, 2025 01:02

style: format code

d21947a

fix: ensure text projector args are considered when computing cache hash

823a82b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support multiple text embedding providers #76

feat: support multiple text embedding providers #76

Uh oh!

peter-gy commented Oct 22, 2025

Uh oh!

donghaoren left a comment

Uh oh!

donghaoren Oct 28, 2025

Uh oh!

donghaoren Oct 28, 2025

Uh oh!

peter-gy Oct 28, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: support multiple text embedding providers #76

Are you sure you want to change the base?

feat: support multiple text embedding providers #76

Uh oh!

Conversation

peter-gy commented Oct 22, 2025

Summary

Changes

Testing

Uh oh!

donghaoren left a comment

Choose a reason for hiding this comment

Uh oh!

donghaoren Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

donghaoren Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

peter-gy Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

peter-gy Oct 28, 2025 •

edited

Loading