[aws-eks] allow specifying timeouts for HelmCharts #8215

gibbster · 2020-05-26T22:50:32Z

For long running helm charts the creation times out.

Reproduction Steps

I had this issue with the following resource (based on a minimal cluster defined through the CDK Cluster component).

new eks.HelmChart(this, 'spinnaker', {
  cluster,
  chart: 'spinnaker',
  repository: 'https://kubernetes-charts.storage.googleapis.com/',
  version: '2.0.0-rc3',
  namespace: 'spinnaker'
});

Error Log

3/9 | 3:05:54 PM | CREATE_FAILED | Custom::AWSCDK-EKS-HelmChart | spinnaker/Resource/Default (spinnakerC4B2DB2C) Failed to create resource. TimeoutError: Connection timed out after 120000ms

Environment

**CLI Version :1.41.0 (build 9e071d2)
**Framework Version: 1.41.0
**OS : macOS 10.14.6
**Language : typescript

Other

The helm command has an option for timeout, and in fact the chart I was trying to run (stable/spinnaker) suggests a timeout of 600s: https://github.com/helm/charts/blob/master/stable/spinnaker/README.md. I believe that the chart should add timeout as a option:

aws-cdk/packages/@aws-cdk/aws-eks/lib/helm-chart.ts

Lines 8 to 50 in 86eac6a

    
           export interface HelmChartOptions { 
        
             /** 
        
              * The name of the chart. 
        
              */ 
        
             readonly chart: string; 
        
             /** 
        
              * The name of the release. 
        
              * @default - If no release name is given, it will use the last 53 characters of the node's unique id. 
        
              */ 
        
             readonly release?: string; 
        
             /** 
        
              * The chart version to install. 
        
              * @default - If this is not specified, the latest version is installed 
        
              */ 
        
             readonly version?: string; 
        
             /** 
        
              * The repository which contains the chart. For example: https://kubernetes-charts.storage.googleapis.com/ 
        
              * @default - No repository will be used, which means that the chart needs to be an absolute URL. 
        
              */ 
        
             readonly repository?: string; 
        
             /** 
        
              * The Kubernetes namespace scope of the requests. 
        
              * @default default 
        
              */ 
        
             readonly namespace?: string; 
        
             /** 
        
              * The values to be used by the chart. 
        
              * @default - No values are provided to the chart. 
        
              */ 
        
             readonly values?: {[key: string]: any}; 
        
             /** 
        
              * Whether or not Helm should wait until all Pods, PVCs, Services, and minimum number of Pods of a 
        
              * Deployment, StatefulSet, or ReplicaSet are in a ready state before marking the release as successful. 
        
              * @default - Helm will not wait before marking release as successful 
        
              */ 
        
             readonly wait?: boolean; 
        
           }

1.41.0 (build 9e071d2)

This is 🐛 Bug Report

The text was updated successfully, but these errors were encountered:

eladb · 2020-05-27T07:21:13Z

It would be possible to allow timeouts of up to 15m. @pahud is this something you'd be interested to pick up?

pahud · 2020-05-27T08:18:30Z

@eladb Sure!

The interesting thing is the Kubelctl Provider default timeout is 15min now.

aws-cdk/packages/@aws-cdk/aws-eks/lib/kubectl-provider.ts

Line 26 in 99e7330

timeout: Duration.minutes(15),

And the error message provided doesn't seem to be a lambda timeout

3/9 | 3:05:54 PM | CREATE_FAILED | Custom::AWSCDK-EKS-HelmChart | spinnaker/Resource/Default (spinnakerC4B2DB2C) Failed to create resource. TimeoutError: Connection timed out after 120000ms

I'll try re-produce it in my environment and check the logs.

eduardomourar · 2020-05-29T15:54:58Z

We are getting the same error while running:

cluster.addChart('Prometheus', {
  chart: 'prometheus-operator',
  repository: 'https://kubernetes-charts.storage.googleapis.com/',
  version: '8.13.8',
  namespace: 'default'
});

eduardomourar · 2020-06-02T23:09:36Z

I can confirm that there is a timeout from the AWS SDK (as specified here) and it can solved by adding to the provider framework code:

lambda.config.update({httpOptions: { timeout: 900000 }});

I will make a PR soon to fix this.

pahud · 2020-06-02T23:49:07Z

It sounds like an underlying http timeout between the provider framework and the kubectl provider which executes the helm command. Is the fix working in your environment? @eduardomourar

eduardomourar · 2020-06-03T00:03:53Z

Yes, so far so good. My helm charts are taking about 3 minutes to deploy and without any issue now.

I believe the issue is that the invokeFunction used by the provider framework has this 2 minutes timeout, so it was not honoring the overall timeout (15 minutes) set for the target lambda.

This creates an additional option called `timeout` that will be passed down whenever deploying helm chart to an EKS cluster. In order to allow the timeout parameter to work while performing helm commands, the provider framework has to honor the maximum timeout of 15 minutes from target process (lambda in this case). closes #8215 ---- *By submitting this pull request, I confirm that my contribution is made under the terms of the Apache-2.0 license*

gibbster added bug This issue is a bug. needs-triage This issue or PR still needs to be triaged. labels May 26, 2020

SomayaB added the @aws-cdk/aws-eks Related to Amazon Elastic Kubernetes Service label May 27, 2020

SomayaB assigned eladb May 27, 2020

eladb added feature-request A feature should be added or improved. and removed bug This issue is a bug. labels May 27, 2020

eladb changed the title ~~HelmChart times out with long running charts~~ eks: allow specifying timeouts for HelmCharts May 27, 2020

eladb added the effort/small Small work item – less than a day of effort label May 27, 2020

SomayaB removed the needs-triage This issue or PR still needs to be triaged. label Jun 2, 2020

eduardomourar mentioned this issue Jun 3, 2020

feat(eks): timeout option helm charts #8338

Merged

SomayaB added the in-progress This issue is being actively worked on. label Jun 5, 2020

mergify bot closed this as completed in #8338 Jun 11, 2020

iliapolo changed the title ~~eks: allow specifying timeouts for HelmCharts~~ [aws-eks] allow specifying timeouts for HelmCharts Aug 16, 2020

iliapolo removed the in-progress This issue is being actively worked on. label Aug 16, 2020

github-actions bot assigned iliapolo Aug 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[aws-eks] allow specifying timeouts for HelmCharts #8215

[aws-eks] allow specifying timeouts for HelmCharts #8215

gibbster commented May 26, 2020

eladb commented May 27, 2020

pahud commented May 27, 2020

eduardomourar commented May 29, 2020

eduardomourar commented Jun 2, 2020

pahud commented Jun 2, 2020 •

edited

Loading

eduardomourar commented Jun 3, 2020

[aws-eks] allow specifying timeouts for HelmCharts #8215

[aws-eks] allow specifying timeouts for HelmCharts #8215

Comments

gibbster commented May 26, 2020

Reproduction Steps

Error Log

Environment

Other

eladb commented May 27, 2020

pahud commented May 27, 2020

eduardomourar commented May 29, 2020

eduardomourar commented Jun 2, 2020

pahud commented Jun 2, 2020 • edited Loading

eduardomourar commented Jun 3, 2020

pahud commented Jun 2, 2020 •

edited

Loading