Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add an example of multi-host inference using SGLang #371

Closed
3 tasks
yankay opened this issue Feb 8, 2025 · 7 comments · Fixed by #377
Closed
3 tasks

Add an example of multi-host inference using SGLang #371

yankay opened this issue Feb 8, 2025 · 7 comments · Fixed by #377
Assignees
Labels
kind/feature Categorizes issue or PR as related to a new feature.

Comments

@yankay
Copy link
Member

yankay commented Feb 8, 2025

What would you like to be added:

Add an example of multi-host inference using SGLang .

Why is this needed:

Recently, There have been some issues about how to run LWS with Sglang

Completion requirements:

This enhancement requires the following artifacts:

  • Design doc
  • API change
  • Docs update

The artifacts should be linked in subsequent comments.

@yankay yankay added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 8, 2025
@hwdef
Copy link
Member

hwdef commented Feb 8, 2025

sglang has special advantages in reasoning about deepseek, it has a large throughput and supports MLA earlier.

@ahg-g
Copy link
Contributor

ahg-g commented Feb 8, 2025

@yankay @hwdef would you like to submit one?

@yankay
Copy link
Member Author

yankay commented Feb 9, 2025

@yankay @hwdef would you like to submit one?

/assign

@kerthcet
Copy link
Contributor

kerthcet commented Feb 9, 2025

If I remember correctly, SGLang doesn't support multi-host right now, PP support is still unfinished at 2024Q4. sgl-project/sglang#1487

@kerthcet
Copy link
Contributor

kerthcet commented Feb 9, 2025

@panpan0000
Copy link

TP -- yes, PP - not yeah

@ahg-g
Copy link
Contributor

ahg-g commented Feb 10, 2025

I think it is ok, it is useful to show case SGLang here!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/feature Categorizes issue or PR as related to a new feature.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants