Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable vLLM Gaudi Support for LLM Service #126

Closed
wants to merge 6 commits into from

Conversation

tianyil1
Copy link
Contributor

@tianyil1 tianyil1 commented May 31, 2024

Description

This PR enabled the vLLM Gaudi support for LLM service, which leveraged the habana/vllm-fork, and will be converted to a formal pull request until the habana team officially releases the vLLM support.

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

  • New feature (non-breaking change which adds new functionality)
    • This PR enabled the vLLM Gaudi support for LLM service.

Dependencies

n/a.

Tests

This PR is tested in the Gaudi2 server with:
2 sockers Intel(R) Xeon(R) Platinum 8368 CPU @ 2.40GHz
8 Gaudi nodes, HL-SMI Version: hl-1.14.0-fw-48.0.1.0 Driver Version: 1.14.0-9e8ecf8

  • vLLM Gaudi Service
    image
  • Client Curl Test
    image

@tianyil1
Copy link
Contributor Author

tianyil1 commented May 31, 2024

This draft PR has been submitted here. Please have a review and check. @Jian-Zhang @carsonwang

Signed-off-by: tianyil1 <tianyi.liu@intel.com>
Signed-off-by: tianyil1 <tianyi.liu@intel.com>
Signed-off-by: tianyil1 <tianyi.liu@intel.com>
Signed-off-by: tianyil1 <tianyi.liu@intel.com>
Signed-off-by: tianyil1 <tianyi.liu@intel.com>
@tianyil1 tianyil1 closed this Jun 6, 2024
@tianyil1 tianyil1 deleted the vllm branch June 6, 2024 05:52
@tianyil1 tianyil1 restored the vllm branch June 6, 2024 05:52
@tianyil1 tianyil1 deleted the vllm branch June 6, 2024 05:53
@tianyil1
Copy link
Contributor Author

tianyil1 commented Jun 6, 2024

Please refer to this new PR based on the officially habana vLLM: #137

lkk12014402 pushed a commit that referenced this pull request Aug 8, 2024
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant