Add documentation on how to do incremental builds #2796

pcmoritz · 2024-02-07T02:32:43Z

On my system, this reduces compilation time from 2:40 min to 0:30 min, therefore enabling faster development iterations.

pcmoritz · 2024-02-07T03:39:19Z

One thing I noticed is the old python setup.py develop is also doing fast incremental compilation, but unfortunately that's deprecated and probably most people don't use it any more (e.g. it prints a big warning banner that it shouldn't be used any more and links to pypa/setuptools#917). It also gives 30s for the build.

The reason why python setup.py develop is fast is that it uses a stable temporary directory for the build, and it also uses the system torch path by default and doesn't copy the files into an unstable temporary directory like pip install -e . does.

Instead of doing something like in the PR, we could encourage people to use python setup.py develop, e.g. by leaving a comment about incremental compilation in setup.py, or by including it in the docs.

zhuohan123

Thanks for the feature! Left a small comment

zhuohan123 · 2024-02-07T08:06:48Z

setup.py

@@ -22,6 +22,9 @@
 ROCM_SUPPORTED_ARCHS = {"gfx90a", "gfx942"}
 # SUPPORTED_ARCHS = NVIDIA_SUPPORTED_ARCHS.union(ROCM_SUPPORTED_ARCHS)

+if "VLLM_INCREMENTAL_BUILD_TORCH_PATH" in os.environ:


Can you add some document on how to use this? Setting the env var to

VLLM_INCREMENTAL_BUILD_TORCH_PATH=`python -c "import torch; print(torch.__path__[0])"`

is confusing to other users.

Thanks, I added the docs, let me know what you think :)

I also tried to do VLLM_INCREMENTAL_BUILD=1 to make it easier btw, but I couldn't figure out how to get the global torch path from within setup.py -- I tried both via python utilities and using subprocess (both with and without shell), but I think there is some environment variable set which points the package dir to the temporary directory. It is probably best not to try to hack around this further :)

pcmoritz · 2024-02-07T19:20:55Z

I was thinking about this a little more, maybe it is best to just document ’python setup.py develop' since it has even less overhead. The other approach uses an internal variable in torch so not clear it is better. I'll make that change.

zhuohan123

LGTM! Thanks!

[ROCm] Fix build problem resulted from previous commit related to FP8 kv-cache support (vllm-project#2790) Add documentation on how to do incremental builds (vllm-project#2796) [Ray] Integration compiled DAG off by default (vllm-project#2471) Disable custom all reduce by default (vllm-project#2808) add usage context removed usage_context from Engine_args Move IO to another process added http request [ROCm] support Radeon™ 7900 series (gfx1100) without using flash-attention (vllm-project#2768) Add documentation section about LoRA (vllm-project#2834) Refactor 2 awq gemm kernels into m16nXk32 (vllm-project#2723) Co-authored-by: Chunan Zeng <chunanzeng@Chunans-Air.attlocal.net> Added additional arg for from_engine_args comments

pcmoritz added 3 commits February 6, 2024 18:17

Enable incremental build

e955203

yapf

0ad56b8

remove hack

90dbcb5

zhuohan123 approved these changes Feb 7, 2024

View reviewed changes

pcmoritz added 2 commits February 7, 2024 10:26

add documentation

5d94f52

add docs

476c120

pcmoritz added 2 commits February 7, 2024 11:42

update

ec1941f

format

2c33d74

pcmoritz changed the title ~~Optionally enable incremental build~~ Add documentation on how to do incremental builds Feb 7, 2024

zhuohan123 approved these changes Feb 7, 2024

View reviewed changes

zhuohan123 merged commit 931746b into vllm-project:main Feb 7, 2024
17 checks passed

alexm-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request Feb 13, 2024

Add documentation on how to do incremental builds (vllm-project#2796)

2da4b50

jvmncs pushed a commit to jvmncs/vllm that referenced this pull request Feb 14, 2024

Add documentation on how to do incremental builds (vllm-project#2796)

e1152b1

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 20, 2024

Add documentation on how to do incremental builds (vllm-project#2796)

9c05340

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024

Add documentation on how to do incremental builds (vllm-project#2796)

7a0823f

andy-neuma mentioned this pull request Feb 23, 2024

andy/bump main to v0.3.2 neuralmagic/nm-vllm#49

Closed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

Add documentation on how to do incremental builds (vllm-project#2796)

0f9d4ff

esmeetu mentioned this pull request Mar 14, 2024

[Doc] fix doc to install build requirement first #3386

Closed

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

Add documentation on how to do incremental builds (vllm-project#2796)

1015a28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documentation on how to do incremental builds #2796

Add documentation on how to do incremental builds #2796

pcmoritz commented Feb 7, 2024 •

edited

Loading

pcmoritz commented Feb 7, 2024 •

edited

Loading

zhuohan123 left a comment

zhuohan123 Feb 7, 2024

pcmoritz Feb 7, 2024

pcmoritz Feb 7, 2024

pcmoritz commented Feb 7, 2024 •

edited

Loading

zhuohan123 left a comment

Add documentation on how to do incremental builds #2796

Add documentation on how to do incremental builds #2796

Conversation

pcmoritz commented Feb 7, 2024 • edited Loading

pcmoritz commented Feb 7, 2024 • edited Loading

zhuohan123 left a comment

Choose a reason for hiding this comment

zhuohan123 Feb 7, 2024

Choose a reason for hiding this comment

pcmoritz Feb 7, 2024

Choose a reason for hiding this comment

pcmoritz Feb 7, 2024

Choose a reason for hiding this comment

pcmoritz commented Feb 7, 2024 • edited Loading

zhuohan123 left a comment

Choose a reason for hiding this comment

pcmoritz commented Feb 7, 2024 •

edited

Loading

pcmoritz commented Feb 7, 2024 •

edited

Loading

pcmoritz commented Feb 7, 2024 •

edited

Loading