cuda-dev #158

woensug-choi · 2021-08-13T04:58:05Z

The rocker cuda PR does not include executables of the Cuda libraries. It's because PR only copied apt-get installation of the Nvidia cudagl's base dockerfile. For our purposes, all those executables are required to compile the .cu source code of the multibeam sonar plugin.

I believe that --cuda flag without Cuda executables could have usability that does not require compile of .cu source files. I've made a new --cuda-dev flag that includes all installations required.

I've tested the functionality with the Dave project's Multibeam Sonar Plugin.
https://github.com/Field-Robotics-Lab/DAVE

woensug-choi · 2022-03-04T00:27:12Z

@tfoote I hope to see this merged or receive comments if you think its not ready :)

tfoote

Overall this looks like a good direction.

For this to go in it definitely needs some unit tests. And I'd like to have a better story about how the different layers of base, runtime and development can be used together. Aka get this working with #126 The lack of tests and validation of cooperation is why I haven't pushed #126 further forward.

And also it should not collide/conflict with the generic --nvidia option.

tfoote · 2022-03-04T10:31:19Z

src/rocker/templates/cuda_dev_snippet.Dockerfile.em

+
+# nvidia-container-runtime
+ENV NVIDIA_VISIBLE_DEVICES all
+ENV NVIDIA_DRIVER_CAPABILITIES compute,utility


These options here need to mutate/extend the driver capabilitiy not just overwrite it. aka it could be used for graphics too.

Changed to

ENV NVIDIA_VISIBLE_DEVICES ${NVIDIA_VISIBLE_DEVICES:-all} ENV NVIDIA_DRIVER_CAPABILITIES ${NVIDIA_DRIVER_CAPABILITIES:+$NVIDIA_DRIVER_CAPABILITIES,}compute,utility

tfoote · 2022-03-04T10:31:52Z

src/rocker/templates/cuda_dev_snippet.Dockerfile.em

+
+ENV CUDA_VERSION 11.2.1
+
+ENV NV_CUDA_LIB_VERSION "11.2.1-1"


Having all of these versions embedded seems very fragile. Is there a way to do this more generically? What's the upgrade path, how can we let people adjust the version looking forward to future releases and other platforms?

The latest version gets updated constantly. It's now at 11.6.0 (https://gitlab.com/nvidia/container-images/cuda/blob/master/dist/11.6.0/ubuntu2004/base/Dockerfile)

How could we exploit https://gitlab.com/nvidia/container-images/cuda/-/blob/master/manifests/cuda.yaml ? If we could load this during the build and have user to type wanted Cuda version (e.g. 11.6.0), it could be much more general.

As you mention the latest gets updated frequently. How can we use the latest version without hard coding all these many versions in our codebase? if they're changing the version constantly we have to check it works and update our codebase too? How do we get notices and know what to update when? Will people's usage break w/o updates? Will we break users if we update too much? How do our users know/declare what versions they get/want?

woensug-choi · 2022-03-07T01:21:52Z

What kinds of unit tests do we need? I've been using this branch with a gazebo plugin that requires Cuda compilation.

What do you mean by layer story? Both devel and runtime image based on the base.

tfoote · 2022-03-08T22:00:10Z

For tests, we need tests that provide coverage of the code paths. They need to run examples that exercise the code paths and exercise the cuda capabilities so that we know that it's still working when we deploy it. We have 84% code coverage on the non-nvidia stuff. If you run it locally with the nvidia tests it's higher. But you content has zero unit tests.

For the story we need to be able to explain to the users. If you want this features do this. If you want that feature do that. If you want this and that t do they work together? In particular since this is taking the capabilities of #126 and going further, it should probably be done as a series of layers that will first enable the base, and then install the specific tools, so that you have core nvidia, then extend it for the runtime, and then extend that for dev tools. Instead of having each plugin potentially installing different separate versions of cuda tools.

woensug-choi · 2022-03-10T00:21:48Z

Hmm... Very much unfamiliar with the code coverage concept. Let me get back to you when I understand what it is about :D

tfoote · 2022-03-17T01:44:20Z

There's an overview of how to run the tests and you'll see reports here: https://github.com/osrf/rocker#testing

woensug-choi · 2022-07-07T05:26:31Z

rocker/src/rocker/templates/cuda_dev_snippet.Dockerfile.em

Lines 10 to 12 in 0dea4de

 RUN apt-get update && apt-get install -y --no-install-recommends \ 

 cuda nvidia-cuda-dev \ 

 && rm -rf /var/lib/apt/lists/*

@tfoote I've made a change to use the package manager. It now avoids specifying versions for dependent packages. I've tested in WSL Ubuntu 20.04. It should also work for the generic Ubuntu installation.

tfoote

I've spent a few hours to converge #182 and this into the branch for #126

And following the installation instructions from https://developer.nvidia.com/cuda-downloads?target_os=Linux&target_arch=x86_64&Distribution=Ubuntu&target_version=22.04&target_type=deb_network as closely as possible.

There seems to be an issue with installing cuda and nvidia-cuda-dev simultaneously. But I've run out of time to iterate on that right now so I've pushed it into the draft.

src/rocker/templates/cuda_dev_snippet.Dockerfile.em

tfoote · 2022-07-07T23:36:52Z

src/rocker/templates/cuda_dev_snippet.Dockerfile.em

@@ -0,0 +1,23 @@
+RUN apt-get update && apt-get install -y --no-install-recommends \


I also found that nvidia-cuda-dev is available all the way back to bionic https://packages.ubuntu.com/bionic/nvidia-cuda-dev so we shouldn't by default necessarily use the nvidia ones but leverage the system ones.

I've tried with and without nvidia-cuda-dev installation block. Both worked for me. No need to include nvidia-cuda-dev for pre-caching purpose.

src/rocker/templates/cuda_dev_snippet.Dockerfile.em

tfoote · 2022-07-13T20:31:44Z

@woensug-choi I think that I've merged all of this into #126 could you verify that it meets your needs now?

Note: I changed the argument to --cuda

woensug-choi · 2022-07-14T05:13:21Z

@tfoote I confirm that everything I need is merged into #126. Closing this PR.

tfoote · 2022-07-14T06:39:28Z

Great thanks for verifying.

cuda-dev

0feae78

woensug-choi requested a review from tfoote as a code owner August 13, 2021 04:58

Merge branch 'osrf:main' into cuda-dev

20b06e0

tfoote reviewed Mar 4, 2022

View reviewed changes

extend capability

74f458d

tfoote added the nvidia Connected to the nvidia extension label Mar 9, 2022

woensug-choi mentioned this pull request Mar 25, 2022

Cuda-dev update for multibeam sonar plugin HonuRobotics/dockwater#9

Open

woensug-choi added 3 commits May 9, 2022 15:44

New Repository Key

3c3ac9c

Merge branch 'cuda-dev' of github.com:woensug-choi/rocker into cuda-dev

32025ae

remove duplicate .list entries

ad257ce

woensug-choi mentioned this pull request May 9, 2022

Adapt to new acoustic_msg format Field-Robotics-Lab/nps_uw_multibeam_sonar#43

Merged

woensug-choi added 2 commits July 7, 2022 14:20

Package manger installation

0dea4de

minor fix

2d89cf1

tfoote reviewed Jul 8, 2022

View reviewed changes

woensug-choi closed this Jul 14, 2022

woensug-choi deleted the cuda-dev branch November 30, 2022 22:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda-dev #158

cuda-dev #158

woensug-choi commented Aug 13, 2021

woensug-choi commented Mar 4, 2022

tfoote left a comment

tfoote Mar 4, 2022

woensug-choi Mar 7, 2022

tfoote Mar 4, 2022

woensug-choi Mar 7, 2022

tfoote Mar 8, 2022

woensug-choi commented Mar 7, 2022

tfoote commented Mar 8, 2022

woensug-choi commented Mar 10, 2022

tfoote commented Mar 17, 2022

woensug-choi commented Jul 7, 2022

tfoote left a comment

tfoote Jul 7, 2022

woensug-choi Jul 11, 2022

tfoote commented Jul 13, 2022

woensug-choi commented Jul 14, 2022

tfoote commented Jul 14, 2022

		@@ -0,0 +1,23 @@
		RUN apt-get update && apt-get install -y --no-install-recommends \

cuda-dev #158

cuda-dev #158

Conversation

woensug-choi commented Aug 13, 2021

woensug-choi commented Mar 4, 2022

tfoote left a comment

Choose a reason for hiding this comment

tfoote Mar 4, 2022

Choose a reason for hiding this comment

woensug-choi Mar 7, 2022

Choose a reason for hiding this comment

tfoote Mar 4, 2022

Choose a reason for hiding this comment

woensug-choi Mar 7, 2022

Choose a reason for hiding this comment

tfoote Mar 8, 2022

Choose a reason for hiding this comment

woensug-choi commented Mar 7, 2022

tfoote commented Mar 8, 2022

woensug-choi commented Mar 10, 2022

tfoote commented Mar 17, 2022

woensug-choi commented Jul 7, 2022

tfoote left a comment

Choose a reason for hiding this comment

tfoote Jul 7, 2022

Choose a reason for hiding this comment

woensug-choi Jul 11, 2022

Choose a reason for hiding this comment

tfoote commented Jul 13, 2022

woensug-choi commented Jul 14, 2022

tfoote commented Jul 14, 2022