Skip to content

Commit

Permalink
feat: nvidia driver extension
Browse files Browse the repository at this point in the history
This patch deprecates the NVIDIA toolkit extension and introduces a new
nvidia-driver extension (in production/lts versions and open
source/proprietary flavors). The NVIDIA container toolkit must be
installed independently, via a future Talos extension, the NVIDIA GPU
Operator, or by the cluster administator.

The extension depends on the new glibc extension (siderolabs#473) and participates
in its filesystem subroot by installing all the NVIDIA components in it.

Finally, the extension runs a service that will bind mount this glibc
subroot at `/run/nvidia/driver` and run the `nvidia-persistenced`
daemon.

This careful setup allows the NVIDIA GPU Operator to utilize this
extension as if it were a traditional NVIDIA driver container.

Signed-off-by: Jean-Francois Roy <jf@devklog.net>
  • Loading branch information
jfroy committed Sep 23, 2024
1 parent d474848 commit 54146b5
Show file tree
Hide file tree
Showing 68 changed files with 475 additions and 1,089 deletions.
8 changes: 4 additions & 4 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -76,12 +76,12 @@ TARGETS += kata-containers
TARGETS += mdadm
TARGETS += mei
TARGETS += nut-client
TARGETS += nvidia-container-toolkit-lts
TARGETS += nvidia-container-toolkit-production
TARGETS += nvidia-fabricmanager-lts
TARGETS += nvidia-fabricmanager-production
TARGETS += nvidia-open-gpu-kernel-modules-lts
TARGETS += nvidia-open-gpu-kernel-modules-production
TARGETS += nvidia-driver-lts
TARGETS += nvidia-driver-production
TARGETS += nvidia-driver-proprietary-lts
TARGETS += nvidia-driver-proprietary-production
TARGETS += qemu-guest-agent
TARGETS += qlogic-firmware
TARGETS += realtek-firmware
Expand Down
3 changes: 1 addition & 2 deletions go.work
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,6 @@ go 1.22

use (
./examples/hello-world-service/src
./nvidia-gpu/nvidia-container-toolkit/nvidia-container-runtime-wrapper
./nvidia-gpu/nvidia-container-toolkit/nvidia-persistenced-wrapper
./nvidia-gpu/nvidia-driver/service
./storage/iscsi-tools/iscsid-wrapper
)
10 changes: 0 additions & 10 deletions nvidia-gpu/nonfree/kmod-nvidia/lts/manifest.yaml

This file was deleted.

4 changes: 0 additions & 4 deletions nvidia-gpu/nonfree/kmod-nvidia/production/files/nvidia.conf

This file was deleted.

10 changes: 0 additions & 10 deletions nvidia-gpu/nonfree/kmod-nvidia/production/manifest.yaml

This file was deleted.

30 changes: 0 additions & 30 deletions nvidia-gpu/nvidia-container-toolkit/DEVELOPMENT.md

This file was deleted.

91 changes: 0 additions & 91 deletions nvidia-gpu/nvidia-container-toolkit/README.md

This file was deleted.

10 changes: 0 additions & 10 deletions nvidia-gpu/nvidia-container-toolkit/lts/manifest.yaml

This file was deleted.

2 changes: 0 additions & 2 deletions nvidia-gpu/nvidia-container-toolkit/lts/vars.yaml

This file was deleted.

This file was deleted.

This file was deleted.

This file was deleted.

This file was deleted.

Loading

0 comments on commit 54146b5

Please sign in to comment.