Skip to content

Commit

Permalink
Add triton P100 FP16 patch - experimental (not tested if working)
Browse files Browse the repository at this point in the history
  • Loading branch information
Phil committed Jul 10, 2024
1 parent 02ecaab commit 8e73ce1
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -205,3 +205,7 @@ ENV VLLM_USAGE_SOURCE production-docker-image

ENTRYPOINT ["python3", "-m", "vllm.entrypoints.openai.api_server"]
#################### OPENAI API SERVER ####################

# patch triton for P100 FP16
RUN --mount=type=cache,target=/root/.cache/pip \
python3 -m pip install --extra-index-url https://sasha0552.github.io/vllm-ci/ --force-reinstall triton

0 comments on commit 8e73ce1

Please sign in to comment.