-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
containerd registry certificates configured on regular EKS node AMI, but not GPU AMI #1154
Comments
Thanks for bringing this to our attention. I've addressed the issue and will update here when an AMI with the fix is publicly available. |
@cartermckinnon Can you reopen this? There is an issue with the new GPU AMI that still prevents this from working. The cert config block was added, but the name of the section precisely matches what was done in the standard compute AMI:
However, on the GPU AMI it seems that it needs to be as follows:
Similar to the sandbox entry:
I've tested this by modifying the config as follows and running
Thanks for hopping on this so quickly earlier, btw. |
What happened:
/etc/containerd/config.toml
/etc/docker/certs.d
with cert identically on both nodesWhat you expected to happen:
How to reproduce it (as minimally and precisely as possible):
/etc/containerd/config.toml
Anything else we need to know?:
A PR previously added a config for containerd to support configuring registry certificates in both the classic docker path and the containerd one: #1049
This is the config block that was added:
This configuration is present on the standard EKS node AMI, but not in any variation on the GPU node AMI. My understanding is that the GPU AMI doesn't use the containerd configs in this repo, and that there is some Amazon internal repo that is the source of truth for the GPU version of the containerd configs. If that is correct, then this internal repo needs to also be updated with a corresponding change to achieve feature parity with the standard EKS ami.
Environment:
aws eks describe-cluster --name <name> --query cluster.platformVersion
): eks.3aws eks describe-cluster --name <name> --query cluster.version
): 1.24uname -a
):Linux ip-10-0-81-145.us-west-2.compute.internal 5.4.226-129.415.amzn2.x86_64 #1 SMP Fri Dec 9 12:54:21 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
cat /etc/eks/release
on a node):The text was updated successfully, but these errors were encountered: