feat: Introduce RFC for Provider Configuration API #1359

cdoern · 2025-03-03T20:01:26Z

What does this PR do?

Introduce an RFC for a Provider Configuration API, allowing for provider configuration changes of an existing stack to aid in common runtime changes like model swapping, endpoint switching, etc

Closes #993

terrytangyuan · 2025-03-04T02:39:58Z

rfcs/RFC-0002-configuration.md

There can also be sensitive configurations such as API keys so we probably want to categorize fields that require redactions.

hopefully this will be handled by the UserConfig class. I would imagine a user should not be able to see any API keys in the returned config

raghotham

Can we try to reduce the scope by calling it /v1/providers. That way we know that we are going to support provider CRUD. We can rejigger /inspect to move out provider specific methods.

The only concern here is the requirement for different software packages based on the provider. Right now, getting dependencies is a build time operation. For now, maybe we support adding new provider endpoints only for providers whose dependencies are already in the distro. Otherwise, we may get into situations where we package everything and cause bloat. Thoughts?

dmartinol · 2025-03-04T07:41:29Z

rfcs/RFC-0002-configuration.md

we also need to check that user_field key in field.json_schema_extra is True, right?

cdoern · 2025-03-04T16:49:39Z

@raghotham :

Can we try to reduce the scope by calling it /v1/providers. That way we know that we are going to support provider CRUD. We can rejigger /inspect to move out provider specific methods.
The only concern here is the requirement for different software packages based on the provider. Right now, getting dependencies is a build time operation. For now, maybe we support adding new provider endpoints only for providers whose dependencies are already in the distro. Otherwise, we may get into situations where we package everything and cause bloat. Thoughts?

Yeah /v1/providers might make more sense here since configuration makes it seem like its a higher level stack config API that can change things like ports.

Yep, it seems fair for an initial version to just be re-configuring already initialized providers! I hadn't thought of using it outside of that realm, good call out!

Introduce an RFC for a Provider Configuration API, allowing for provider configuration changes of an existing stack to aid in common runtime changes like model swapping, endpoint switching, etc Signed-off-by: Charlie Doern <cdoern@redhat.com>

currently the `inspect` API for providers is really a `list` API. Create a new `providers` API which has a GET `providers/{provider_id}` inspect API which returns "user friendly" configuration to the end user. Also add a GET `/providers` endpoint which returns the list of providers as `inspect/providers` does today. This API follows CRUD and is more intuitive/RESTful. This work is part of the RFC at llamastack#1359 Signed-off-by: Charlie Doern <cdoern@redhat.com>

# What does this PR do? currently the `inspect` API for providers is really a `list` API. Create a new `providers` API which has a GET `providers/{provider_id}` inspect API which returns "user friendly" configuration to the end user. Also add a GET `/providers` endpoint which returns the list of providers as `inspect/providers` does today. This API follows CRUD and is more intuitive/RESTful. This work is part of the RFC at #1359 sensitive fields are redacted using `redact_sensetive_fields` on the server side before returning a response: <img width="456" alt="Screenshot 2025-03-13 at 4 40 21 PM" src="https://github.com/user-attachments/assets/9465c221-2a26-42f8-a08a-6ac4a9fecce8" /> ## Test Plan using llamastack/llama-stack-client-python#181 a user is able to to run the following: `llama stack build --template ollama --image-type venv` `llama stack run --image-type venv ~/.llama/distributions/ollama/ollama-run.yaml` `llama-stack-client providers inspect ollama` <img width="378" alt="Screenshot 2025-03-13 at 4 39 35 PM" src="https://github.com/user-attachments/assets/8273d05d-8bc3-44c6-9e4b-ef95e48d5466" /> also, was able to run the new test_list integration test locally with ollama: <img width="1509" alt="Screenshot 2025-03-13 at 11 03 40 AM" src="https://github.com/user-attachments/assets/9b9db166-f02f-45b0-86a4-306d85149bc8" /> Signed-off-by: Charlie Doern <cdoern@redhat.com>

add `v1/providers/` which uses PUT to allow users to change their provider configuration this is a follow up to llamastack#1429 and related to llamastack#1359 a user can call something like: `llama_stack_client.providers.update(api="inference", provider_id="ollama", provider_type="remote::ollama", config={'url': 'http:/localhost:12345'})` or `llama-stack-client providers update inference ollama remote::ollama "{'url': 'http://localhost:12345'}"` this API works by adding a `RequestMiddleware` to the server which checks requests, and if the user is using PUT /v1/providers, the routes are re-registered with the re-initialized provider configurations/methods for the client, `self.impls` is updated to hold the proper methods+configurations this depends on a client PR, the CI will fail until then but succeeded locally Signed-off-by: Charlie Doern <cdoern@redhat.com>

github-actions · 2025-05-05T00:13:43Z

This pull request has been automatically marked as stale because it has not had activity within 60 days. It will be automatically closed if no further activity occurs within 30 days.

github-actions · 2025-06-05T00:13:24Z

This pull request has been automatically closed due to inactivity. Please feel free to reopen if you intend to continue working on it!

add `v1/providers/` which uses PUT to allow users to change their provider configuration this is a follow up to llamastack#1429 and related to llamastack#1359 a user can call something like: `llama_stack_client.providers.update(api="inference", provider_id="ollama", provider_type="remote::ollama", config={'url': 'http:/localhost:12345'})` or `llama-stack-client providers update inference ollama remote::ollama "{'url': 'http://localhost:12345'}"` this API works by adding a `RequestMiddleware` to the server which checks requests, and if the user is using PUT /v1/providers, the routes are re-registered with the re-initialized provider configurations/methods for the client, `self.impls` is updated to hold the proper methods+configurations this depends on a client PR, the CI will fail until then but succeeded locally Signed-off-by: Charlie Doern <cdoern@redhat.com>

cdoern requested review from ashwinb, dineshyv, dltn, ehhuang, hardikjshah, raghotham, sixianyi0721, terrytangyuan, vladimirivic and yanxi0830 as code owners March 3, 2025 20:01

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 3, 2025

cdoern changed the title ~~RFC: Configuration API~~ feat: Introduce RFC for Configuration API Mar 3, 2025

cdoern force-pushed the config-proposal branch 2 times, most recently from 9b1a1e3 to 5d337fb Compare March 3, 2025 20:05

terrytangyuan reviewed Mar 4, 2025

View reviewed changes

raghotham reviewed Mar 4, 2025

View reviewed changes

dmartinol reviewed Mar 4, 2025

View reviewed changes

cdoern force-pushed the config-proposal branch from 5d337fb to 1e77a3f Compare March 5, 2025 16:04

cdoern requested a review from SLR722 as a code owner March 5, 2025 16:04

cdoern changed the title ~~feat: Introduce RFC for Configuration API~~ feat: Introduce RFC for Provider Configuration API Mar 5, 2025

cdoern requested review from raghotham and terrytangyuan March 5, 2025 16:05

cdoern mentioned this pull request Mar 5, 2025

feat: add provider API for listing and inspecting provider info #1429

Merged

RFC: Provider Configuration API

3f105a8

Introduce an RFC for a Provider Configuration API, allowing for provider configuration changes of an existing stack to aid in common runtime changes like model swapping, endpoint switching, etc Signed-off-by: Charlie Doern <cdoern@redhat.com>

cdoern force-pushed the config-proposal branch from 1e77a3f to 3f105a8 Compare March 5, 2025 21:08

cdoern mentioned this pull request Apr 8, 2025

feat: implement provider updating #1905

Closed

github-actions bot added the stale label May 5, 2025

github-actions bot closed this Jun 5, 2025

cdoern mentioned this pull request Jun 11, 2025

Configuration Management #2386

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Introduce RFC for Provider Configuration API #1359

feat: Introduce RFC for Provider Configuration API #1359

Uh oh!

cdoern commented Mar 3, 2025 •

edited

Loading

Uh oh!

terrytangyuan Mar 4, 2025

Uh oh!

cdoern Mar 4, 2025

Uh oh!

raghotham left a comment

Uh oh!

dmartinol Mar 4, 2025

Uh oh!

cdoern commented Mar 4, 2025

Uh oh!

github-actions bot commented May 5, 2025

Uh oh!

github-actions bot commented Jun 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: Introduce RFC for Provider Configuration API #1359

feat: Introduce RFC for Provider Configuration API #1359

Uh oh!

Conversation

cdoern commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

terrytangyuan Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

cdoern Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

raghotham left a comment

Choose a reason for hiding this comment

Uh oh!

dmartinol Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

cdoern commented Mar 4, 2025

Uh oh!

github-actions bot commented May 5, 2025

Uh oh!

github-actions bot commented Jun 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

cdoern commented Mar 3, 2025 •

edited

Loading