Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AMD GPUs support #81

Open
ybannay opened this issue Sep 30, 2024 · 2 comments
Open

AMD GPUs support #81

ybannay opened this issue Sep 30, 2024 · 2 comments

Comments

@ybannay
Copy link

ybannay commented Sep 30, 2024

Hi guys,

I played with the operator and it worked well for NVIDIA GPUs, but not for AMD. Do you have in mind to add support for AMD GPUs or if we can contribute to the project adding the support.

Thanks

@rdjjke
Copy link
Collaborator

rdjjke commented Sep 30, 2024

Hello.

Thank you for the first issue in this repo!

We’re currently focused on improving the stability of Soperator (tests and the like), so we had no plans to support other GPUs shortly.

But if you implement it yourself, we will help you.

However, the task looks difficult. For NVIDIA GPUs, we use nvidia-container-toolkit for propagating device drivers and some low-level libraries from the “Host” to the “Jail” environment. I'm not sure if there is anything like that for AMD. Anyway, the main difficulties will be with container images, not with the operator code.

@rdjjke
Copy link
Collaborator

rdjjke commented Sep 30, 2024

BTW, you said that you played with the operator, and in what environment (cloud?) if it's not a secret? We haven't tested Soperator anywhere outside of Nebius Сloud, so I'm wondering where else we can assume it works.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants