KOTS installer 5 minute timeout is too low #14444
Labels
component: install
Terraform installation scripts, helm charts, installer images
team: delivery
Issue belongs to the self-hosted team
type: bug
Something isn't working
Bug description
While observing a customer call we saw that an upgrade of a real-world environment was taking over 4 minutes; this is precariously close to the 5 minute
helm upgrade
timeout.Additional debugging pointed the finger at shiftfs-module-loader. This init container compiles the shiftfs kernel module (which is slow) and the version of Kubernetes on some distributions updates daemonsets one node at a time. Combining the slow runtime of shiftfs-module-loader with the linear scaling properties of daemonsets, it's possible for the installation process to be very lengthy on larger clusters.
Steps to reproduce
n/a
Workspace affected
No response
Expected behavior
The helm upgrade process should have a timeout period long enough that it can't accidentally be reached. A value of 10 - 15 minutes is less risky.
Example repository
No response
Anything else?
No response
The text was updated successfully, but these errors were encountered: