docs: Update README.md to bypass nvidia runtime for dcgm-exporter service when deploying container for etcd and nats #3148
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.


Overview:
When setting up etcd & NATS by following the instruction in https://github.com/ai-dynamo/dynamo?tab=readme-ov-file#1-initial-setup and using command "docker compose -f deploy/docker-compose.yml up -d", the command failed with the below error, either etcd or NATS brought up.
✘ Container deploy-dcgm-exporter-1 Error response from daemon: unk... 0.0s
Error response from daemon: unknown or invalid runtime name: nvidia
Details:
The root cause is the dcgm-exporter service defined in docker-compose.yml explicitly uses the nvidia container runtime like below, which was not installed and running in the environment. To deploy the services successfully, the "runtime: nvidia" parameter needs to be commented out. This PR is to add additional information to the instruction to help others avoid the same issue.
Where should the reviewer start?
The "Install etcd and NATS (required)" section of the main README.md file
Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)