[Doc]: Update the vllm distributed Inference and Serving with the new MultiprocessingGPUExecutor #5221

rcarrata · 2024-06-03T15:22:23Z

📚 The doc issue

The vLLM documentation only reflects the possibility to use Ray for running Distributed Inference and Serving with vLLM, even though the #4539 issue is merged and v0.4.3 is released with the MultiprocessingGPUExecutor feature included as an alternative to Ray for single-node inferencing.

Suggest a potential alternative/fix

Update the documentation to reflect the possibility of using MultiprocessingGPUExecutor as an alternative to Ray for single-node inferencing.

youkaichao · 2024-06-03T16:34:47Z

Contributions are welcome!

Also update docs to reflect support for the multiprocessing distributed executor. Resolves vllm-project#4955 Resolves vllm-project#5221

njhill · 2024-06-03T21:55:27Z

I have included these doc updates in #5230.

rcarrata · 2024-06-04T09:53:34Z

thanks for your help @njhill, much appreciated!

rcarrata added the documentation Improvements or additions to documentation label Jun 3, 2024

njhill added a commit to njhill/vllm that referenced this issue Jun 3, 2024

[Core][Doc] Default to multiprocessing for single-node distributed case

89fe0f5

Also update docs to reflect support for the multiprocessing distributed executor. Resolves vllm-project#4955 Resolves vllm-project#5221

njhill mentioned this issue Jun 3, 2024

[Core][Doc] Default to multiprocessing for single-node distributed case #5230

Merged

simon-mo closed this as completed in #5230 Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc]: Update the vllm distributed Inference and Serving with the new MultiprocessingGPUExecutor #5221

[Doc]: Update the vllm distributed Inference and Serving with the new MultiprocessingGPUExecutor #5221

rcarrata commented Jun 3, 2024

youkaichao commented Jun 3, 2024

njhill commented Jun 3, 2024

rcarrata commented Jun 4, 2024

[Doc]: Update the vllm distributed Inference and Serving with the new MultiprocessingGPUExecutor #5221

[Doc]: Update the vllm distributed Inference and Serving with the new MultiprocessingGPUExecutor #5221

Comments

rcarrata commented Jun 3, 2024

📚 The doc issue

Suggest a potential alternative/fix

youkaichao commented Jun 3, 2024

njhill commented Jun 3, 2024

rcarrata commented Jun 4, 2024