Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Initial support for routers (using AWS Bedrock) #1631

Open
10 tasks
Tracked by #1782
jvstme opened this issue Aug 27, 2024 · 2 comments
Open
10 tasks
Tracked by #1782

[Feature]: Initial support for routers (using AWS Bedrock) #1631

jvstme opened this issue Aug 27, 2024 · 2 comments

Comments

@jvstme
Copy link
Collaborator

jvstme commented Aug 27, 2024

Problem

Some users need to work with both custom and off-the-shelf language models. Deploying off-the-shelf models with dstack may be less convenient and financially viable than using platforms such as AWS Bedrock or Vertex AI, which provide models as a service. In addition, some proprietary models are only available through MaaS platforms. This results in users having to switch back and forth between dstack and MaaS platforms.

Solution

Add support for MaaS platforms starting with AWS Bedrock. Add new router configurations that will allow Bedrock models to be exposed through the dstack-gateway OpenAI-compatible API.

Workaround

Users can use MaaS platforms directly or use proxy solutions to bring MaaS models and models deployed by dstack into a single interface.

Implementation Steps

  • Storing data about models in dstack-gateway
  • Internal API for model management in dstack-gateway
  • Bedrock implementation in dstack-gateway
  • MaaS-specific settings in backend configuration
  • Storing data about models and routers in dstack-server
  • dstack-server routers API
  • Routers Python API
  • dstack apply router configurations
  • dstack router list and dstack router delete commands
  • Documentation
Copy link

github-actions bot commented Oct 3, 2024

This issue is stale because it has been open for 30 days with no activity.

@github-actions github-actions bot added the stale label Oct 3, 2024
@peterschmidt85 peterschmidt85 changed the title [Feature]: Model-as-a-Service with AWS Bedrock [Feature]: Initial support for routers (using AWS Bedrock) Oct 3, 2024
@peterschmidt85 peterschmidt85 mentioned this issue Oct 3, 2024
49 tasks
Copy link

github-actions bot commented Nov 3, 2024

This issue is stale because it has been open for 30 days with no activity.

@github-actions github-actions bot added the stale label Nov 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants