Run locally #2

g30ba1 · 2020-05-04T22:11:37Z

I've built all the containers successfully.

But I've NO idea how to run them.

Any tip?

dusty-nv · 2020-05-21T18:08:42Z

Hi @g30ba1 , you can find the run instructions on NGC. For example:

https://ngc.nvidia.com/catalog/containers/nvidia:l4t-ml

sudo docker run -it --rm --runtime nvidia --network host nvcr.io/nvidia/l4t-ml:r32.4.2-py3

uersoy · 2020-05-30T16:29:59Z

Hi @g30ba1 , you can find the run instructions on NGC. For example:

https://ngc.nvidia.com/catalog/containers/nvidia:l4t-ml
sudo docker run -it --rm --runtime nvidia --network host nvcr.io/nvidia/l4t-ml:r32.4.2-py3

Hello @dusty-nv,

I can run the container at localhost:8888 however all my saved work gets lost the next time I access the same. Seems like no checkpoint is saved when I save my jupyterlab files. Do you know why this may be happening?

Thanks!

g30ba1 · 2020-05-30T21:29:23Z

Hi @g30ba1 , you can find the run instructions on NGC. For example:

https://ngc.nvidia.com/catalog/containers/nvidia:l4t-ml
sudo docker run -it --rm --runtime nvidia --network host nvcr.io/nvidia/l4t-ml:r32.4.2-py3

Hi Dusty, thank you for your time.

I found that if we re-built the containers, the command to run is:

sudo docker run -it --rm --runtime nvidia --network host -v /home/user/Documents/Projects:/home l4t-tensorflow:r32.4.2-tf1.15-py3

(-v flag indicates that we will be working on a local folder)

NGC containers are a nice tool to get an environment as quickly as possible.

g30ba1 · 2020-05-30T21:37:01Z

Hi @g30ba1 , you can find the run instructions on NGC. For example:
https://ngc.nvidia.com/catalog/containers/nvidia:l4t-ml
sudo docker run -it --rm --runtime nvidia --network host nvcr.io/nvidia/l4t-ml:r32.4.2-py3
Hello @dusty-nv,

I can run the container at localhost:8888 however all my saved work gets lost the next time I access the same. Seems like no checkpoint is saved when I save my jupyterlab files. Do you know why this may be happening?

Thanks!

You must use the -v flag to WORK on a local folder while using the container, i.e.:

sudo docker run -it --rm --runtime nvidia --network host -v /home/user/Documents/Projects:/home l4t-tensorflow:r32.4.2-tf1.15-py3

If you are suing the container published on Nvidia´s NGC, the command is:

sudo docker run -it --rm --runtime nvidia --network host -v /home/user/project:/location/in/container nvcr.io/nvidia/l4t-ml:r32.4.2-py3

To use a local folder, you can use any path existent on your local machine (host), or you can create the folder while you´re INSIDE the container.

uersoy · 2020-05-31T19:07:40Z

Hi @g30ba1 , you can find the run instructions on NGC. For example:
https://ngc.nvidia.com/catalog/containers/nvidia:l4t-ml
sudo docker run -it --rm --runtime nvidia --network host nvcr.io/nvidia/l4t-ml:r32.4.2-py3
Hello @dusty-nv,
I can run the container at localhost:8888 however all my saved work gets lost the next time I access the same. Seems like no checkpoint is saved when I save my jupyterlab files. Do you know why this may be happening?
Thanks!
You must use the -v flag to WORK on a local folder while using the container, i.e.:

sudo docker run -it --rm --runtime nvidia --network host -v /home/user/Documents/Projects:/home l4t-tensorflow:r32.4.2-tf1.15-py3

If you are suing the container published on Nvidia´s NGC, the command is:

sudo docker run -it --rm --runtime nvidia --network host -v /home/user/project:/location/in/container nvcr.io/nvidia/l4t-ml:r32.4.2-py3

To use a local folder, you can use any path existent on your local machine (host), or you can create the folder while you´re INSIDE the container.

Thank you Jorge. I can use a local folder to save my scripts, notebooks however I am wondering how to keep the changes I make to the whole container image. Let's say I install new dependencies, libraries on top of existing default ones. Will I have to create a new container image?

uersoy · 2020-06-01T01:55:42Z

Hi @g30ba1 , you can find the run instructions on NGC. For example:
https://ngc.nvidia.com/catalog/containers/nvidia:l4t-ml
sudo docker run -it --rm --runtime nvidia --network host nvcr.io/nvidia/l4t-ml:r32.4.2-py3
Hello @dusty-nv,
I can run the container at localhost:8888 however all my saved work gets lost the next time I access the same. Seems like no checkpoint is saved when I save my jupyterlab files. Do you know why this may be happening?
Thanks!
You must use the -v flag to WORK on a local folder while using the container, i.e.:
sudo docker run -it --rm --runtime nvidia --network host -v /home/user/Documents/Projects:/home l4t-tensorflow:r32.4.2-tf1.15-py3
If you are suing the container published on Nvidia´s NGC, the command is:
sudo docker run -it --rm --runtime nvidia --network host -v /home/user/project:/location/in/container nvcr.io/nvidia/l4t-ml:r32.4.2-py3
To use a local folder, you can use any path existent on your local machine (host), or you can create the folder while you´re INSIDE the container.
Thank you Jorge. I can use a local folder to save my scripts, notebooks however I am wondering how to keep the changes I make to the whole container image. Let's say I install new dependencies, libraries on top of existing default ones. Will I have to create a new container image?

I figured out. The below link explains it.
https://docs.nvidia.com/deeplearning/frameworks/user-guide/index.html

g30ba1 · 2020-06-01T02:18:23Z

Nice find! I'm already reading it.

Regarding your question, yes, you have to re-built an image, to add libraries, dependencies or any other requirements for your projects.

uersoy · 2020-06-01T03:34:59Z

Nice find! I'm already reading it.

Regarding your question, yes, you have to re-built an image, to add libraries, dependencies or any other requirements for your projects.

I also read the whole thing to learn but I found the answer to my question in Section 10.1.4.

Build EfficientViT package using PyTorch-distributed

Patch 1

Revert update to Llama-3.2

dusty-nv mentioned this issue May 28, 2020

torchvision.ops.nms fails on GPU data inside the container nvcr.io/nvidia/l4t-pytorch:r32.4.2-pth1.3-py3, but works as expected on the host OS #7

Open

UserName-wang mentioned this issue Aug 24, 2023

try to build built container images failed #274

Open

dusty-nv pushed a commit that referenced this issue Nov 3, 2023

Merge pull request #2 from tokk-nv/dev-dependency-only-by-package-name

9a9f1a4

Build EfficientViT package using PyTorch-distributed

UserName-wang mentioned this issue Aug 20, 2024

No module named 'clip_trt' #593

Open

dusty-nv pushed a commit that referenced this issue Sep 12, 2024

Merge pull request #2 from johnnynunez/patch-1

7313228

Patch 1

dusty-nv pushed a commit that referenced this issue Oct 13, 2024

Merge pull request #2 from TangmereCottage/TangmereCottage-patch-1

c042534

Revert update to Llama-3.2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run locally #2

Run locally #2

g30ba1 commented May 4, 2020

dusty-nv commented May 21, 2020

uersoy commented May 30, 2020

g30ba1 commented May 30, 2020

g30ba1 commented May 30, 2020

uersoy commented May 31, 2020

uersoy commented Jun 1, 2020

g30ba1 commented Jun 1, 2020

uersoy commented Jun 1, 2020

Run locally #2

Run locally #2

Comments

g30ba1 commented May 4, 2020

dusty-nv commented May 21, 2020

uersoy commented May 30, 2020

g30ba1 commented May 30, 2020

g30ba1 commented May 30, 2020

uersoy commented May 31, 2020

uersoy commented Jun 1, 2020

g30ba1 commented Jun 1, 2020

uersoy commented Jun 1, 2020