Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[AI Compose] Support many model deployments for AI Service #4880

Open
vhvb1989 opened this issue Mar 4, 2025 · 0 comments
Open

[AI Compose] Support many model deployments for AI Service #4880

vhvb1989 opened this issue Mar 4, 2025 · 0 comments
Assignees
Labels
compose composability

Comments

@vhvb1989
Copy link
Member

vhvb1989 commented Mar 4, 2025

Current approach is to use one AI Service per model. It simplifies the e2e experience when folks don't need to think about connections of services and just list the models to deploy.

For this issue, we want to support the default case of just listing the models in azure.yaml as a way to define all models to the same connection/service.

And then, introduce the schema for modeling more than one connection if folks want to do that.

@vhvb1989 vhvb1989 self-assigned this Mar 4, 2025
@kristenwomack kristenwomack added the compose composability label Mar 5, 2025
@kristenwomack kristenwomack changed the title [Ai Compose] Support many model deployments for AI Service [AI Compose] Support many model deployments for AI Service Mar 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
compose composability
Projects
None yet
Development

No branches or pull requests

2 participants