-
Notifications
You must be signed in to change notification settings - Fork 320
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use kernel 4.18 in EKS and ECS Amazon Linux AMIs to solve CFS throttling issues. #175
Comments
+1 this, would be great for FUSE too that shipped in 4.18. |
According to an excellent detailed response from AWS Support, the sched/fair: Fix bandwidth timer clock drift condition patch [0] is merged into the Linux kernel version 4.14.y branch and is available as part of 4.14.95 release [1]. The latest EKS Optimized Amazon Linux 2 AMI has a Linux kernel version which includes the patch.
Reference: [1] Linux kernel patch 4.14.95 I believe this issue can be closed. |
I have switched all the nodes to |
It was subsequently reported that torvalds/linux@512ac99 introduced a regression. A patch to correct this can be found at https://lkml.org/lkml/2019/5/17/581 (not yet merged AFAICT). |
Not merged. The author, Dave Chiluk, is looking for support on the LKML to get attention to this patch, to get it reviewed and merged. |
It looks like that patch is getting closer to being merged from that thread. I can imagine it's still probably months away from getting into the amazon linux build though. |
Reports of great improvements using 4.14.133 with 512ac99 and Dave Chiluk’s patch backported into it: kubernetes/kubernetes#67577 (comment) https://gist.github.com/PaulFurtado/ff6c67ec87416b66ba1c6fc70f7beec1 Hopefully test results like these will help it get merged soon. |
In our case we're looking for: https://github.com/torvalds/linux/commit/79e9fed46038/ As we're exhausting the ephemeral TCP port range in containers. Also in 4.18. |
As per release-notes the Amazon Linux 2, base image for the EKS/ECS Optmized AMI's already have available the kernel 4.19.x[1] I have upgraded my EKS Opmtized AMI with the 4.19.x kernel to confirm the version/patch available.
With that I was able to confirm that the net-tcp: extend tcp_tw_reuse sysctl to enable loopback only optimization. patch that you required is available already on the 4.19 kernel provided by the AL2 Now is a question of time to the Amazon Linux 2 to start use the Kernel 4.19 as default, or you can use the https://github.com/awslabs/amazon-eks-ami, and build your own AMI with the AWS Supported Kernel as default. Sadly the Dave Chiluk’s patch is not backported to neither versions of the kernel till now. Altought the version 512ac99 is available on both 4.19, 4.14 Reference |
Amazon linux extras provide kernel-ng which is a 4.19 based kernel. |
Dave Chiluk’s patch is en route to be in the 5.4 kernel (the next release now that 5.3 out), and should shortly be available in 5.4-rc1.
|
Amazon Linux 2 just had rolled out the new kernel version 4.14.154 is available for usage.
|
thanks mate, I've just created a new image based on amazon-eks-ami and that
kernel got installed:
amazon-ebs: ---> Package kernel.x86_64 0:4.14.154-128.181.amzn2 will be
installed
…On Wed, 27 Nov 2019 at 00:46, Hugo Prudente ***@***.***> wrote:
Amazon Linux 2 just had rolled out the new kernel version 4.14.154 is
available for usage.
***@***.*** ~]$ uname -a
Linux ip-172-31-38-202.eu-west-1.compute.internal 4.14.154-128.181.amzn2.x86_64 #1 SMP Sat Nov 16 21:49:00 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#175?email_source=notifications&email_token=ALD7MZTKC4YK4ZWESQDNRTDQVUSE5A5CNFSM4G2Q4FT2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEFGBSRY#issuecomment-558635335>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALD7MZWQFKZZD6GNG5PSYEDQVUSE5ANCNFSM4G2Q4FTQ>
.
|
EKS AMIs have just been released with Kernel
See ami-087a82f6b78a07557 in |
Closing this issue as resolved with the release of the latest EKS AMI. See release notes here https://github.com/awslabs/amazon-eks-ami/releases/tag/v20191213 |
Apologies as this is kind of an Amazon linux 2 issue, but directly effects EKS and ECS.
Does anyone know when the Linux kernel version 4.18 will be shipped in Amazon Linux 2?
When it is, can the EKS and ECS images be rebuilt?
torvalds/linux@512ac99#diff-1c5364196d98130348bddabaad0a701f
The patch above should fix the issue with CFS quotas leading to process throttling. This should enable us to use limits everywhere again without setting off prometheus alerts or degrading performance in latency sensitive components!
kubernetes/kubernetes#67577
The text was updated successfully, but these errors were encountered: