Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ensemble multi-GPU #7794

Open
xiazi-yu opened this issue Nov 14, 2024 · 2 comments
Open

ensemble multi-GPU #7794

xiazi-yu opened this issue Nov 14, 2024 · 2 comments
Assignees
Labels
module: server Issues related to the server core question Further information is requested

Comments

@xiazi-yu
Copy link

In ensemble mode, is it possible to set instance_group gpus: [0,1,2,3] in the config.pbtxt so that the pipeline runs in the following manner: preprocessing (GPU0) ---> inference (GPU0), preprocessing (GPU1) ---> inference (GPU1), preprocessing (GPU2) ---> inference (GPU2), preprocessing (GPU3) ---> inference (GPU3), instead of preprocessing (GPU0) ---> inference (GPU1)?

@xiazi-yu xiazi-yu changed the title ensemble multi gpu ensemble multi-GPU Nov 14, 2024
@rmccorm4
Copy link
Contributor

Hi @xiazi-yu, I believe this is not possible to force which GPUs are selected when scheduling between models within an ensemble and multiple GPU choices (multiple model instances) are available.

CC @Tabrizian as I believe you investigated this in the past, in case you have any extra comments.

@rmccorm4 rmccorm4 added question Further information is requested module: server Issues related to the server core labels Nov 15, 2024
@rmccorm4 rmccorm4 self-assigned this Nov 15, 2024
@xiazi-yu
Copy link
Author

My ensemble model consists of over a dozen sub-models. When using multiple GPUs compared to a single GPU, there is a significant performance drop.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: server Issues related to the server core question Further information is requested
Development

No branches or pull requests

2 participants