feat: Enable setting a default embedding model in the stack #3803

franciscojavierarceo · 2025-10-14T04:23:03Z

What does this PR do?

Enables automatic embedding model detection for vector stores and by using a default_configured boolean that can be defined in the run.yaml.

Test Plan

Unit tests
Integration tests
Simple example below:

Spin up the stack:

uv run llama stack build --distro starter --image-type venv --run

Then test with OpenAI's client:

from openai import OpenAI
client = OpenAI(base_url="http://localhost:8321/v1/", api_key="none")
vs = client.vector_stores.create()

Previously you needed:

vs = client.vector_stores.create(
    extra_body={
        "embedding_model": "sentence-transformers/all-MiniLM-L6-v2",
        "embedding_dimension": 384,
    }
)

The extra_body is now unnecessary.

Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>

ehhuang · 2025-10-14T23:37:13Z

llama_stack/providers/utils/memory/openai_vector_store_mixin.py

+            raise ValueError(
+                f"Multiple embedding models marked as default_configured=True: {model_ids}. "
+                "Only one embedding model can be marked as default."
+            )


This should be checked when Stack was initialized instead.

I added additional validation when the stack is initialized (as well as a test to confirm) for this I think we can keep it here as well in case we allow for models to be dynamically registered and a second default were to slip in. Let me know if you'd like me to remove it though. 👍

ehhuang · 2025-10-14T23:38:02Z

llama_stack/providers/utils/memory/openai_vector_store_mixin.py

+            # Embedding model was provided but dimension wasn't, look it up
+            embedding_dimension = await self._get_embedding_dimension_for_model(embedding_model)


why not just call this in _get_default_embedding_model_and_dimension?

Updated. 👍

ehhuang · 2025-10-14T23:44:36Z

llama_stack/core/library_client.py

            if param_name in body:
                value = body.get(param_name)
                if param_name in exclude_params:
                    converted_body[param_name] = value


maybe unrelated to this PR, but reading due to the change below: do we only allow one such parameter? if so, assert?

there's a break above that silently skips the others. i can add some validation to the above if we want.

ehhuang · 2025-10-14T23:50:13Z

tests/unit/providers/vector_io/test_vector_io_openai_vector_stores.py

+                        "embedding_dimension": 768,
+                        "default_configured": True,


non-blocking: we should formalize these parameters in a EmbeddingModel class.

yeah i can do that as a follow up

Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>

ehhuang

LG!

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 14, 2025

franciscojavierarceo force-pushed the default-embedding-model branch 6 times, most recently from b8168ff to f8cb3c4 Compare October 14, 2025 20:20

feat: Enable setting a default embedding model in the stack

86c1e3b

Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>

franciscojavierarceo force-pushed the default-embedding-model branch from fc87c1e to 86c1e3b Compare October 14, 2025 21:16

franciscojavierarceo marked this pull request as ready for review October 14, 2025 21:22

franciscojavierarceo requested review from ashwinb, bbrowning, ehhuang, hardikjshah, leseb, mattf, raghotham, reluctantfuturist, slekkala1, terrytangyuan and yanxi0830 as code owners October 14, 2025 21:22

ehhuang reviewed Oct 14, 2025

View reviewed changes

incorporating feedback

5a4b291

Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>

ehhuang approved these changes Oct 15, 2025

View reviewed changes

ehhuang merged commit ef4bc70 into llamastack:main Oct 15, 2025
21 checks passed

franciscojavierarceo mentioned this pull request Oct 15, 2025

feat: mongodb vector io #3772

Open

leseb mentioned this pull request Oct 15, 2025

feat(vector-io): implement global default embedding model configuration #2918

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Enable setting a default embedding model in the stack #3803

feat: Enable setting a default embedding model in the stack #3803

Uh oh!

franciscojavierarceo commented Oct 14, 2025 •

edited

Loading

Uh oh!

ehhuang Oct 14, 2025

Uh oh!

franciscojavierarceo Oct 15, 2025

Uh oh!

ehhuang Oct 14, 2025

Uh oh!

franciscojavierarceo Oct 15, 2025

Uh oh!

ehhuang Oct 14, 2025

Uh oh!

franciscojavierarceo Oct 15, 2025

Uh oh!

ehhuang Oct 14, 2025

Uh oh!

franciscojavierarceo Oct 15, 2025

Uh oh!

ehhuang left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		# Embedding model was provided but dimension wasn't, look it up
		embedding_dimension = await self._get_embedding_dimension_for_model(embedding_model)

feat: Enable setting a default embedding model in the stack #3803

feat: Enable setting a default embedding model in the stack #3803

Uh oh!

Conversation

franciscojavierarceo commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ehhuang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

franciscojavierarceo commented Oct 14, 2025 •

edited

Loading