optional vllm microservice container build #266
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR adds optional
build-arg
's for this Dockerfile: comps/llms/text-generation/vllm/docker/Dockerfile.microserviceIssues
Current dockerfile always defaults to
gpu
Torch installation whileCPU
users might not necessarily need extra packages.This leads to smaller containers for CPU users and faster build.
For users interested to do
gpu
builds all they need to do is build this image this way:Type of change
List the type of change like below. Please delete options that are not relevant.
build-arg
optionARCH
to buildvllm microservice
container for either CPU or GPU.Dependencies
None