openvino support in vllm #65

zahidulhaque · 2024-05-17T05:57:25Z

Description

This PR adds support for openvino as a backend inference engine to the vllm.

Type of change

Add script to build Docker image and instructions for building it.
Provides examples for starting the vLLM serving container with OpenAI API endpoint and interacting with it via bash shell
Includes additional server start-up parameters and a curl example for requesting completion with vLLM

comps/llms/vllm-openvino/build_vllm_openvino.sh

comps/llms/vllm-openvino/README.md

comps/llms/vllm-openvino/launch_model_server.sh

* Adds Docker image and instructions for building it * Provides examples for starting the vLLM serving container with OpenAI API endpoint and interacting with it via bash shell * Includes additional server start-up parameters and a curl example for requesting completion with vLLM Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

for more information, see https://pre-commit.ci Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

hshen14 · 2024-05-17T12:30:08Z

@zahidulhaque please get all tests passed. Right now some typo issue was detected.

comps/llms/vllm-openvino/README.md

lvliang-intel · 2024-05-20T15:25:52Z

Please fix pre-commit issue.

mkbhanda

I would add to the README motivation for using OpenVINO backend inference service, a few sentences like the runtime generates hardware target specific optimizations...

comps/llms/vllm-openvino/README.md

comps/llms/vllm-openvino/launch_model_server.sh

comps/llms/vllm-openvino/README.md

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

mkbhanda

Looks good!

mkbhanda

Noted changes pertaining to License, hugging face token rename, service name change, and handling of results. Trusting your tests. Next time please strive for smaller PRs! Easier to digest all round.

comps/llms/text-generation/vllm-openvino/build_vllm_openvino.sh

chensuyue · 2024-06-05T03:08:41Z

Please contribute an e2e test for this microservice, like this one: https://github.com/opea-project/GenAIComps/blob/main/tests/test_reranks.sh, please name it as test_llms_text-generation_vllm-openvino.sh, I will let GHA trigger the test.

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

comps/llms/text-generation/vllm-openvino/1

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

zahidulhaque · 2024-06-07T04:18:04Z

Please contribute an e2e test for this microservice, like this one: https://github.com/opea-project/GenAIComps/blob/main/tests/test_reranks.sh, please name it as test_llms_text-generation_vllm-openvino.sh, I will let GHA trigger the test.

@chensuyue, i am working on writing the test case and will create a separate PR for this.

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com> Signed-off-by: gadmarkovits <gad.markovits@intel.com>

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com> Signed-off-by: sharanshirodkar7 <ssharanshirodkar7@gmail.com>

bharagha reviewed May 17, 2024

View reviewed changes

comps/llms/vllm-openvino/build_vllm_openvino.sh Outdated Show resolved Hide resolved

comps/llms/vllm-openvino/README.md Outdated Show resolved Hide resolved

comps/llms/vllm-openvino/launch_model_server.sh Outdated Show resolved Hide resolved

zahidulhaque and others added 3 commits May 17, 2024 11:56

Add the build and launch script

dfd30f1

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

9eff145

for more information, see https://pre-commit.ci Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

hshen14 assigned lvliang-intel May 17, 2024

hshen14 reviewed May 17, 2024

View reviewed changes

comps/llms/vllm-openvino/README.md Outdated Show resolved Hide resolved

amberjain1 approved these changes May 20, 2024

View reviewed changes

mkbhanda reviewed May 29, 2024

View reviewed changes

zahidulhaque and others added 4 commits May 31, 2024 09:48

Fixed review comments

af1caf8

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

ca81b9e

for more information, see https://pre-commit.ci

Merge branch 'opea-project:main' into vllm-openvino-support

d3bec0b

Moved vllm-openvino directory under text-generation

f75d4b0

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

mkbhanda approved these changes May 31, 2024

View reviewed changes

Merge branch 'main' into vllm-openvino-support

ff148b0

mkbhanda approved these changes Jun 4, 2024

View reviewed changes

zhlsunshine reviewed Jun 4, 2024

View reviewed changes

comps/llms/text-generation/vllm-openvino/build_vllm_openvino.sh Outdated Show resolved Hide resolved

chensuyue reviewed Jun 5, 2024

View reviewed changes

comps/llms/text-generation/vllm-openvino/build_vllm_openvino.sh Outdated Show resolved Hide resolved

ftian1 approved these changes Jun 5, 2024

View reviewed changes

chensuyue reviewed Jun 5, 2024

View reviewed changes

comps/llms/text-generation/vllm-openvino/build_vllm_openvino.sh Show resolved Hide resolved

lvliang-intel approved these changes Jun 5, 2024

View reviewed changes

lvliang-intel and others added 2 commits June 5, 2024 13:44

Merge branch 'main' into vllm-openvino-support

cf23cc5

Updated the proxy command and License header

54b6b2b

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

chensuyue reviewed Jun 7, 2024

View reviewed changes

comps/llms/text-generation/vllm-openvino/1 Outdated Show resolved Hide resolved

Removed the redundant file

1396663

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com>

chensuyue merged commit 7dbad07 into opea-project:main Jun 7, 2024
6 checks passed

zahidulhaque deleted the vllm-openvino-support branch June 7, 2024 05:52

gadmarkovits pushed a commit to gadmarkovits/GenAIComps that referenced this pull request Jun 19, 2024

openvino support in vllm (opea-project#65)

f277899

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com> Signed-off-by: gadmarkovits <gad.markovits@intel.com>

zahidulhaque mentioned this pull request Jun 26, 2024

OVMS #197

Closed

sharanshirodkar7 pushed a commit to sharanshirodkar7/GenAIComps that referenced this pull request Jul 9, 2024

openvino support in vllm (opea-project#65)

b348e78

Signed-off-by: Zahidul Haque <zahidul.haque@intel.com> Signed-off-by: sharanshirodkar7 <ssharanshirodkar7@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

openvino support in vllm #65

openvino support in vllm #65

zahidulhaque commented May 17, 2024

hshen14 commented May 17, 2024

lvliang-intel commented May 20, 2024

mkbhanda left a comment

mkbhanda left a comment

mkbhanda left a comment •

edited

Loading

chensuyue commented Jun 5, 2024

zahidulhaque commented Jun 7, 2024

openvino support in vllm #65

openvino support in vllm #65

Conversation

zahidulhaque commented May 17, 2024

Description

Type of change

hshen14 commented May 17, 2024

lvliang-intel commented May 20, 2024

mkbhanda left a comment

Choose a reason for hiding this comment

mkbhanda left a comment

Choose a reason for hiding this comment

mkbhanda left a comment • edited Loading

Choose a reason for hiding this comment

chensuyue commented Jun 5, 2024

zahidulhaque commented Jun 7, 2024

mkbhanda left a comment •

edited

Loading