Automate the GPU Operator installation NVIDIA AI Enterprise special cases #1055

supertetelman · 2021-11-08T19:11:08Z

NVIDIA AI Enterprise relies on special NVIDIA drivers that enable GPUs in a virtualized environment (vGPU). This is part of the product offerings that are delivered through NGC.

In order for NVIDIA AI Enterprise to function properly with the GPU Operator, there are a few special steps required. This includes using special helm packages for the operator, providing NGC keys to access private container resources, and using special configuration variables.

I propose we do the following:

Break the current GPU Operator install path into a standard install and NVIDIA AI Enterprise install path
Implement the NVIDIA AI Enterprise role tasks
Introduce several new variables to support installing the NVIDIA AI Enterprise path and optionally, support the new NVIDIA License Server used in AI Enterprise (NLS)
Document the new variables, install path, etc.

ajdecon · 2021-12-02T16:14:16Z

This should be addressed by #1059

supertetelman · 2021-12-10T00:32:12Z

We should tested this end-to-end today in master and it looks good!

supertetelman self-assigned this Nov 8, 2021

supertetelman closed this as completed Dec 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automate the GPU Operator installation NVIDIA AI Enterprise special cases #1055

Automate the GPU Operator installation NVIDIA AI Enterprise special cases #1055

supertetelman commented Nov 8, 2021 •

edited

Loading

ajdecon commented Dec 2, 2021

supertetelman commented Dec 10, 2021

Automate the GPU Operator installation NVIDIA AI Enterprise special cases #1055

Automate the GPU Operator installation NVIDIA AI Enterprise special cases #1055

Comments

supertetelman commented Nov 8, 2021 • edited Loading

ajdecon commented Dec 2, 2021

supertetelman commented Dec 10, 2021

supertetelman commented Nov 8, 2021 •

edited

Loading