Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automate the GPU Operator installation NVIDIA AI Enterprise special cases #1055

Closed
4 tasks done
supertetelman opened this issue Nov 8, 2021 · 2 comments
Closed
4 tasks done
Assignees

Comments

@supertetelman
Copy link
Collaborator

supertetelman commented Nov 8, 2021

NVIDIA AI Enterprise relies on special NVIDIA drivers that enable GPUs in a virtualized environment (vGPU). This is part of the product offerings that are delivered through NGC.

In order for NVIDIA AI Enterprise to function properly with the GPU Operator, there are a few special steps required. This includes using special helm packages for the operator, providing NGC keys to access private container resources, and using special configuration variables.

I propose we do the following:

  • Break the current GPU Operator install path into a standard install and NVIDIA AI Enterprise install path
  • Implement the NVIDIA AI Enterprise role tasks
  • Introduce several new variables to support installing the NVIDIA AI Enterprise path and optionally, support the new NVIDIA License Server used in AI Enterprise (NLS)
  • Document the new variables, install path, etc.
@supertetelman supertetelman self-assigned this Nov 8, 2021
@ajdecon
Copy link
Collaborator

ajdecon commented Dec 2, 2021

This should be addressed by #1059

@supertetelman
Copy link
Collaborator Author

We should tested this end-to-end today in master and it looks good!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants