ArgoCD start to DDoS related Helm Repository #12314

fotto1 · 2023-02-07T10:36:23Z

Checklist:

I've searched in the docs and FAQ for my answer: https://bit.ly/argocd-faq.
I've included steps to reproduce the bug.
I've pasted the output of argocd version.

Describe the bug

Our ArgoCD Application start to DDoS our Helm Repositories.

For our deployment we use gitops, which uses an app of apps approach to create nearly 64 different ArgoCD Applications. For our gitops we have multiple branches and do multiple deployments on same argocd environment in one AWS EKS Cluster.

We have 32 Deployments means 2048 ArgoCD Applications based on same helm repository + 32 Argo CD Application running as app of apps over gitops.

To Reproduce

Configure a Gitops Repository with a set of ArgoCD Applications using same Helm Chart Repository and a sync policy of 1 minute.
Greate multiple of this ArgoCD Applications by using gitops.
Check access log on Helm Chart Repository (I use jFrog Artifactory), for calls to index.yaml File.

Expected behavior

After ArgoCD calls always the same index.yaml File in the Helm Chart Repository I would expect that it is cached in ArgoCD and used as shared resource between ArgoCD Applications. Not that every single ArgoCD Application calls the index.yaml File itself.

Our index.yaml has 20.26 MB x 2048 ArgoCD Applications calling one time per Hour for 12 hours. That makes 20.26 MB x 2048 x 12 = 497.909,76 MB ~= 486,24 GB per Day.

That is what we also see from our access logs in our Helm Repository.

Our expectation is 20,26 MB x 24 x 60 to check file onces per minute which should result in 28,49 GB per day, so that file is cached centrally.

Version

argocd: v2.6.0+acc554f
  BuildDate: 2023-02-06T21:44:18Z
  GitCommit: acc554f3d99010e0353b498a595844b30090556f
  GitTreeState: clean
  GoVersion: go1.18.10
  Compiler: gc
  Platform: linux/amd64
Handling connection for 8081
Handling connection for 8081
argocd-server: v2.5.6+9db2c94
  BuildDate: 2023-01-10T19:30:17Z
  GitCommit: 9db2c9471f6ff599c3f630b446e940d3a065620b
  GitTreeState: clean
  GoVersion: go1.18.9
  Compiler: gc
  Platform: linux/amd64
  Kustomize Version: v4.5.7 2022-08-02T16:35:54Z
  Helm Version: v3.10.3+g835b733
  Kubectl Version: v0.24.2
  Jsonnet Version: v0.18.0

The text was updated successfully, but these errors were encountered:

jessesuen · 2023-02-09T01:52:26Z

The feature request makes sense. We need the equivalent of what we are currently doing with git ls-remote but for a helm chart repository.

For the implementation, I feel we could cache the index.yaml in redis, and make subsequent requests for index.yaml with the If-Modified-Since header. If the response is HTTP 304, then we can use the cache result.

This all presumes jFrog Artifactory supports the If-Modified-Since header.

@fotto1 are you open to submitting a fix for this?

fotto1 · 2023-02-09T12:38:16Z

@jessesuen sounds like a good proposal. Maybe you have somewhere the link to merge request for the similar change on git repositories.

Can you provide some guidance which classes must be touch to implement the change you proposed?

After that I will check if I can do the change by my own.

fotto1 · 2023-10-05T06:46:02Z

Seems also related to the problem described by @hmoravec here #8698

able8 · 2023-12-22T08:39:31Z

We often encounter helm dependency build failed timeout after 1m30s. #3977 (comment)
It would be nice to cache the helm charts.

rpc error: code = Unknown desc = Manifest generation error (cached): 'helm dependency build' failed timeout after 3mOs

fotto1 · 2024-03-13T14:58:04Z

Based on multiple discussion also with my colleague @andrei-gavrila we identified the underlying issue.

Seems argocd use for every helm chart an repo url and a cache. Means every configured application has an own cache and store a index.yaml file.

argo-cd/util/helm/client.go

Line 105 in 565aa8e

indexCache indexCache

type nativeHelmChart struct {
	chartCachePaths argoio.TempPaths
	repoURL         string
	creds           Creds
	repoLock        sync.KeyLock
	enableOci       bool
	indexCache      indexCache
	proxy           string
}

This is how the index is retrieved:

argo-cd/util/helm/client.go

Line 239 in 565aa8e

    
           if err := c.indexCache.GetHelmIndex(c.repoURL, &data); err != nil && err != cache.ErrCacheMiss {

if !noCache && c.indexCache != nil {
    if err := c.indexCache.GetHelmIndex(c.repoURL, &data); err != nil && err != cache.ErrCacheMiss {
        log.Warnf("Failed to load index cache for repo: %s: %v", c.repoURL, err)
    }
}

if len(data) == 0 {
    start := time.Now()
    var err error
    data, err = c.loadRepoIndex()
    if err != nil {
        return nil, err
    }
    log.WithFields(log.Fields{"seconds": time.Since(start).Seconds()}).Info("took to get index")

    if c.indexCache != nil {
        if err := c.indexCache.SetHelmIndex(c.repoURL, data); err != nil {
            log.Warnf("Failed to store index cache for repo: %s: %v", c.repoURL, err)
        }
    }
}

Unfortunately, this is how it's really retrieved (no helm involvement):

argo-cd/util/helm/client.go

Line 300 in 565aa8e

func (c *nativeHelmChart) loadRepoIndex() ([]byte, error) {

func (c *nativeHelmChart) loadRepoIndex() ([]byte, error) {
	indexURL, err := getIndexURL(c.repoURL)
	if err != nil {
		return nil, err
	}

	req, err := http.NewRequest(http.MethodGet, indexURL, nil)
	if err != nil {
		return nil, err
	}
	if c.creds.Username != "" || c.creds.Password != "" {
		// only basic supported
		req.SetBasicAuth(c.creds.Username, c.creds.Password)
	}

	tlsConf, err := newTLSConfig(c.creds)
	if err != nil {
		return nil, err
	}

	tr := &http.Transport{
		Proxy:             proxy.GetCallback(c.proxy),
		TLSClientConfig:   tlsConf,
		DisableKeepAlives: true,
	}
	client := http.Client{Transport: tr}
	resp, err := client.Do(req)
	if err != nil {
		return nil, err
	}
	defer func() { _ = resp.Body.Close() }()

	if resp.StatusCode != http.StatusOK {
		return nil, errors.New("failed to get index: " + resp.Status)
	}
	return io.ReadAll(resp.Body)
}

No helm involvement, see the http.NewRequest(http.MethodGet, indexURL, nil) call and yes that means every helm chart has its own cache.

The original feature commit @alexmt that enabled caching in argocd (includes an environment variable to configured cache lifetime + the cache is set on the client - eg. 1000 charts, one repo, one client -> one cache)

5889bbb

Not sure why this have been changed but the change explains our problem.

* fix: cache helm-index in Redis cluster Signed-off-by: JenTing Hsiao <hsiaoairplane@gmail.com> * Update repository.go Fix order Signed-off-by: Dan Garfield <dan@codefresh.io> --------- Signed-off-by: JenTing Hsiao <hsiaoairplane@gmail.com> Signed-off-by: Dan Garfield <dan@codefresh.io> Co-authored-by: Dan Garfield <dan@codefresh.io>

* fix: cache helm-index in Redis cluster Signed-off-by: JenTing Hsiao <hsiaoairplane@gmail.com> * Update repository.go Fix order Signed-off-by: Dan Garfield <dan@codefresh.io> --------- Signed-off-by: JenTing Hsiao <hsiaoairplane@gmail.com> Signed-off-by: Dan Garfield <dan@codefresh.io> Co-authored-by: Dan Garfield <dan@codefresh.io> Signed-off-by: ashutosh16 <ashutosh_singh@intuit.com>

* fix: cache helm-index in Redis cluster Signed-off-by: JenTing Hsiao <hsiaoairplane@gmail.com> * Update repository.go Fix order Signed-off-by: Dan Garfield <dan@codefresh.io> --------- Signed-off-by: JenTing Hsiao <hsiaoairplane@gmail.com> Signed-off-by: Dan Garfield <dan@codefresh.io> Co-authored-by: Dan Garfield <dan@codefresh.io>

fotto1 added the bug Something isn't working label Feb 7, 2023

allanger mentioned this issue Feb 9, 2023

ApplicationSet generators diff with CLI #10895

Closed

jenting mentioned this issue Aug 14, 2024

fix: cache helm-index in Redis cluster (#12314) #19530

Merged

14 tasks

todaywasawesome closed this as completed in #19530 Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ArgoCD start to DDoS related Helm Repository #12314

ArgoCD start to DDoS related Helm Repository #12314

fotto1 commented Feb 7, 2023 •

edited

Loading

jessesuen commented Feb 9, 2023

fotto1 commented Feb 9, 2023

fotto1 commented Oct 5, 2023

able8 commented Dec 22, 2023

fotto1 commented Mar 13, 2024 •

edited

Loading

ArgoCD start to DDoS related Helm Repository #12314

ArgoCD start to DDoS related Helm Repository #12314

Comments

fotto1 commented Feb 7, 2023 • edited Loading

jessesuen commented Feb 9, 2023

fotto1 commented Feb 9, 2023

fotto1 commented Oct 5, 2023

able8 commented Dec 22, 2023

fotto1 commented Mar 13, 2024 • edited Loading

fotto1 commented Feb 7, 2023 •

edited

Loading

fotto1 commented Mar 13, 2024 •

edited

Loading