Skip to content

Failing to drain/cordon causes CrashLoopBackOff #545

@liorfranko

Description

@liorfranko

Describe the bug
When running in IMDS mode and aws-node-termination-handler can't drain a node due to PDB the pod crash
Why when running in IMDS mode the pod exit?
https://github.com/aws/aws-node-termination-handler/blob/main/cmd/node-termination-handler.go#L401

Application Logs
2021/12/07 08:02:20 ERR There was a problem while trying to cordon and drain the node error="[error when evicting pods/\"audience-consumer-64cb9bc5d6-h866g\" -n \"delivery-apps\": global timeout reached: 2m0s, error when evicting pods/\"audience-consumer-64cb9bc5d6-x9nq4\" -n \"delivery-apps\": global timeout reached: 2m0s, error when evicting pods/\"audience-consumer-64cb9bc5d6-999ft\" -n \"delivery-apps\": global timeout reached: 2m0s, error when evicting pods/\"events-restore-consumer-86c75c6759-l926p\" -n \"delivery-apps\": global timeout reached: 2m0s, error when evicting pods/\"audience-consumer-64cb9bc5d6-xp4ph\" -n \"delivery-apps\": global timeout reached: 2m0s, error when evicting pods/\"audience-consumer-64cb9bc5d6-ccjkr\" -n \"delivery-apps\": global timeout reached: 2m0s, error when evicting pods/\"audience-consumer-64cb9bc5d6-g79dn\" -n \"delivery-apps\": global timeout reached: 2m0s]"

Environment

  • NTH App Version:
    1.14.0
  • NTH Mode (IMDS/Queue processor):
    IMDS
  • Kubernetes version:
    Client Version: v1.18.9-eks-d1db3c
    Server Version: v1.20.7-eks-d88609
  • Installation method:
    Helm

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions