Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

optional vllm microservice container build #266

Merged
merged 2 commits into from
Jul 4, 2024

Conversation

ashahba
Copy link
Collaborator

@ashahba ashahba commented Jun 30, 2024

Description

This PR adds optional build-arg's for this Dockerfile: comps/llms/text-generation/vllm/docker/Dockerfile.microservice

Issues

Current dockerfile always defaults to gpu Torch installation while CPU users might not necessarily need extra packages.
This leads to smaller containers for CPU users and faster build.

For users interested to do gpu builds all they need to do is build this image this way:

docker build --build-arg ARCH='gpu' -f comps/llms/text-generation/vllm/docker/Dockerfile.microservice . -t opea/llm-vllm:latest

Type of change

List the type of change like below. Please delete options that are not relevant.

  • extra build-arg option ARCH to build vllm microservice container for either CPU or GPU.

Dependencies

None

ashahba added 2 commits June 30, 2024 16:49
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
@hshen14 hshen14 requested a review from lvliang-intel July 1, 2024 23:43
@chensuyue chensuyue merged commit 9637553 into opea-project:main Jul 4, 2024
7 checks passed
@ashahba ashahba deleted the ashahba/vllm-docker branch July 4, 2024 17:35
ashahba added a commit to ashahba/GenAIComps that referenced this pull request Jul 4, 2024
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
yogeshmpandey pushed a commit to yogeshmpandey/GenAIComps that referenced this pull request Jul 10, 2024
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
Signed-off-by: Yogesh Pandey <yogesh.pandey@intel.com>
dwhitena pushed a commit to predictionguard/GenAIComps that referenced this pull request Jul 24, 2024
Signed-off-by: Abolfazl Shahbazi <abolfazl.shahbazi@intel.com>
Signed-off-by: Daniel Whitenack <whitenack.daniel@gmail.com>
lkk12014402 pushed a commit that referenced this pull request Aug 8, 2024
Signed-off-by: PeterYang12 <yuhan.yang@intel.com>
Co-authored-by: chen-hu-97 <hu1.chen@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants