Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: on-prem gateway #1402

Open
morfeusys opened this issue Jul 6, 2024 · 2 comments
Open

[Feature]: on-prem gateway #1402

morfeusys opened this issue Jul 6, 2024 · 2 comments
Labels

Comments

@morfeusys
Copy link

Problem

For now I have to create some gateway via AWS or other supported cloud to start any service with dstack.
But if I have on-prem pool of hardware, I actually don't need to publish my inference API and configure any DNS only to have scalable service running.

For example: developer needs to start inference service of any model for testing and development purposes.
For how they have to do one extra step to start sending HTTP requests - configure gateway and DNS. But it may be too complicated if they only want to utilise existing on-prem hardware.
Moreover - they may decide to make service available only in private on-prem network instead of publishing it in the cloud.
Also using cloud services like AWS could be prohibited by company security policy.

Solution

Some gateway component that could be started locally.

Workaround

Run task instead of service:

  • lack of scaling
  • port forwarding

Would you like to help us implement this feature by sending a PR?

No

@r4victor
Copy link
Collaborator

r4victor commented Jul 8, 2024

@morfeusys, thanks for the issue. Support for on-prem gateways makes a lot of sense and should certainly be on our roadmap. The simplest implementation would allow you to provision a gateway on an on-prem instance, and then you would configure a domain to point to the gateway instance IP – same as for cloud gateways. This should work both for public and private networks.

Now, you also mention you "don't need configure any DNS". Currently, dstack uses domains to identify and route services. Surely, you can send service requests to instance IP directly but this won't allow scaling beyond one gateway instance (which is currently not possible but planned).

Do you have a particular vision on how services without domains should work? Will on-prem gateways with DNS setup work for you (as it should be the easiest way to support on-prem gateways)?

@peterschmidt85 peterschmidt85 mentioned this issue Jul 9, 2024
42 tasks
@peterschmidt85
Copy link
Contributor

This issue is stale because it has been open for 30 days with no activity.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants