-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add an example of multi-host inference using SGLang #371
Comments
sglang has special advantages in reasoning about deepseek, it has a large throughput and supports MLA earlier. |
If I remember correctly, SGLang doesn't support multi-host right now, PP support is still unfinished at 2024Q4. sgl-project/sglang#1487 |
Searched the document, they do support multi-host with TP. |
TP -- yes, PP - not yeah |
I think it is ok, it is useful to show case SGLang here! |
What would you like to be added:
Add an example of multi-host inference using SGLang .
Why is this needed:
Recently, There have been some issues about how to run LWS with Sglang
Completion requirements:
This enhancement requires the following artifacts:
The artifacts should be linked in subsequent comments.
The text was updated successfully, but these errors were encountered: