You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Cloud provider or hardware configuration: gke (machine type n1-highcpu-4)
OS (e.g. from /etc/os-release): Alpine Linux v3.5
Kernel (e.g. uname -a): Linux cc332daac761 4.9.13-moby SMP Sat Mar 25 02:48:44 UTC 2017 x86_64 Linux
What happened:
We have an issue with the cluster-autoscaler where new pods are stuck on Pending and a new node isn't being created. We see these events in the pod:
FirstSeen LastSeen Count From SubObjectPath Type Reason Message
--------- -------- ----- ---- ------------- -------- ------ -------
3m 7s 22 cluster-autoscaler Normal NotTriggerScaleUp pod didn't trigger scale-up (it wouldn't fit if a new node is added)
4m 0s 17 default-scheduler Warning FailedScheduling No nodes are available that match all of the following predicates:: Insufficient cpu (2).
What you expected to happen:
The pods request only 1 CPU resource, so pods would definitely fit on a new node of instance type n1-highcpu-4.
How to reproduce it (as minimally and precisely as possible):
We can reproduce this by creating a new simple cluster with the following command:
With this configuration new nodes are not being created at all with the NotTriggerScaleUp event message returned by the cluster-autoscaler. When we perform the exact same steps except remove the affinity setting from the configuration new nodes are created without a problem. It seems that the AntiAffinity in some way makes the cluster-autoscaler incorrectly think that there wouldn't be any room on a new node.
The text was updated successfully, but these errors were encountered:
Thank you for the bug report. This has been fixed in CA 0.5.3 and will be included in the next 1.6 K8S release. We hope the new version of Kubernetes will be available in GKE late next week or, if unlucky, a couple days later.
Is this a BUG REPORT or FEATURE REQUEST? (choose one): Bug report
Kubernetes version (use kubectl version):
Environment:
What happened:
We have an issue with the cluster-autoscaler where new pods are stuck on Pending and a new node isn't being created. We see these events in the pod:
What you expected to happen:
The pods request only 1 CPU resource, so pods would definitely fit on a new node of instance type n1-highcpu-4.
How to reproduce it (as minimally and precisely as possible):
We can reproduce this by creating a new simple cluster with the following command:
gcloud container clusters create scale-test --cluster-version 1.6.2 --zone us-east1-b --additional-zones us-east1-c --machine-type n1-highcpu-4 --num-nodes 1 --preemptible --enable-autoupgrade --enable-autorepair --enable-autoscaling --min-nodes 1 --max-nodes 10
We then run
kubectl apply -f "deploy.yml"
with the following configuration:With this configuration new nodes are not being created at all with the NotTriggerScaleUp event message returned by the cluster-autoscaler. When we perform the exact same steps except remove the
affinity
setting from the configuration new nodes are created without a problem. It seems that the AntiAffinity in some way makes the cluster-autoscaler incorrectly think that there wouldn't be any room on a new node.The text was updated successfully, but these errors were encountered: