-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug(kernel): process hangs due to io_uring #2070
Comments
this is an issue in the upstream Linux kernel - see here for details and a possible interim work around amazonlinux/amazon-linux-2023#840 (comment) |
@dbarrosop Setting the environment variable UV_USE_IO_URING=0 will mitigate this for |
Thanks a lot to both of you for the information :) |
Thank you, also applies to |
It would be nice if the team could release new AL2023 images based on AMIs which use the recently released |
For information latest official AL2023 EKS images (v20241205) still ship |
@sylr there's a release in-progress now that will address this: https://github.com/awslabs/amazon-eks-ami/releases/tag/v20241213 The new AMIs will be available in all regions within 24 hours 👍 |
thanks all, release for v20241213 with |
What happened:
In our staging environment we have karpenter configured to update AMIs as they come to detect issues before upgrading production. After rolling out new nodes with the AMI
amazon-eks-node-al2023-arm64-standard-1.29-v20241115
(image IDami-068228c3265014a86
) workloads with CPU limits seem to break. Changing the AMI to the previous versionamazon-eks-node-al2023-arm64-standard-1.29-v20241109
(image IDami-0f12332f7a73b222b
) fixes the problem.What you expected to happen:
Upgrading shouldn't break workloads.
How to reproduce it (as minimally and precisely as possible):
Environment:
You can replicate the issue with the following steps (might be easier ways, but this is similar-ish to how we noticed as we use these nodes to build software, you can probably ignore or remove the affinities and tolerations):
Then connect to the terminal and run:
what you will notice (I hope) is that the installation will eventually just freeze. Reverting to a previous AMI makes this work again. Removing CPU limits also fixes the issue so that might be a clue of where the issue lies.
Thanks!
The text was updated successfully, but these errors were encountered: