Skip to content

Commit

Permalink
installing official vLLM into container (deepjavalibrary#1162)
Browse files Browse the repository at this point in the history
  • Loading branch information
Qing Lan authored Oct 7, 2023
1 parent d761f5f commit b71f7bd
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 2 deletions.
3 changes: 2 additions & 1 deletion serving/docker/deepspeed.Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,7 @@ ARG djl_version=0.24.0~SNAPSHOT
ARG python_version=3.9
ARG torch_version=2.0.1
ARG torch_vision_version=0.15.2
ARG vllm_version=0.2.0
ARG deepspeed_wheel="https://publish.djl.ai/deepspeed/deepspeed-nightly-py2.py3-none-any.whl"
ARG flash_attn_wheel="https://publish.djl.ai/flash_attn/flash_attn-1.0.9-cp39-cp39-linux_x86_64.whl"
ARG dropout_layer_norm_wheel="https://publish.djl.ai/flash_attn/dropout_layer_norm-0.1-cp39-cp39-linux_x86_64.whl"
Expand Down Expand Up @@ -72,7 +73,7 @@ RUN apt-get update && \
${lmi_vllm_wheel} ${lmi_dist_wheel} ${seq_scheduler_wheel} ${peft_wheel} protobuf==${protobuf_version} \
transformers==${transformers_version} \
mpi4py sentencepiece einops accelerate==${accelerate_version} bitsandbytes==${bitsandbytes_version} \
optimum==${optimum_version} auto-gptq==${auto_gptq_version} \
optimum==${optimum_version} auto-gptq==${auto_gptq_version} vllm==${vllm_version} pandas pyarrow \
diffusers[torch]==${diffusers_version} opencv-contrib-python-headless safetensors scipy && \
scripts/install_aitemplate.sh && \
scripts/patch_oss_dlc.sh python && \
Expand Down
2 changes: 1 addition & 1 deletion tests/integration/llm/prepare.py
Original file line number Diff line number Diff line change
Expand Up @@ -704,7 +704,7 @@ def build_vllm_model(model):
options["engine"] = "Python"
options["option.rolling_batch"] = "vllm"
options["option.output_formatter"] = "jsonlines"
write_model_artifacts(options, ["vllm==0.2.0", "pandas", "pyarrow"])
write_model_artifacts(options)


def build_unmerged_lora_correctness_model(model):
Expand Down

0 comments on commit b71f7bd

Please sign in to comment.