tmp storage does not get cleaned up when large helm repository index file fails to process #1451

xeruf · 2024-04-18T12:21:47Z

I am fetching the truecharts helmrepository once an hour: https://open.greenhost.net/xeruf/stackspout/-/blob/main/infrastructure/sources/truecharts.yaml?ref_type=heads

but it is fetched more often, and for some reason all old copies are kept in tmp:

After a few hours, this pod occupies a few GB and it already went up to 60GB! This is crashing my whole cluster, and I am clueless what the heck to do to fix this.

The text was updated successfully, but these errors were encountered:

stefanprodan · 2024-04-18T12:32:12Z

We cleanup tmp at the end of each reconciliation, if the files are still there then something blocks the controller from deleting them.

stefanprodan · 2024-04-18T12:35:44Z

Also when reporting issue you need to provide which version of Flux are your running, this may be an old buggy version that we no longer support. Please post flux check.

xeruf · 2024-04-18T15:53:44Z

❯ flux check
► checking prerequisites
✗ flux 2.1.2 <2.2.3 (new version is available, please upgrade)
✔ Kubernetes 1.28.2+k3s1 >=1.25.0-0
► checking controllers
✔ helm-controller: deployment ready
► ghcr.io/fluxcd/helm-controller:v0.36.2
✔ kustomize-controller: deployment ready
► ghcr.io/fluxcd/kustomize-controller:v1.1.1
✔ source-controller: deployment ready
► ghcr.io/fluxcd/source-controller:v1.1.2
► checking crds
✔ helmcharts.source.toolkit.fluxcd.io/v1beta2
✔ buckets.source.toolkit.fluxcd.io/v1beta2
✔ helmreleases.helm.toolkit.fluxcd.io/v2beta1
✔ gitrepositories.source.toolkit.fluxcd.io/v1
✔ helmrepositories.source.toolkit.fluxcd.io/v1beta2
✔ ocirepositories.source.toolkit.fluxcd.io/v1beta2
✔ kustomizations.kustomize.toolkit.fluxcd.io/v1
✔ all checks passed

stefanprodan · 2024-04-18T16:21:44Z

If this was an issue in Flux 2.1 then tones of users would have reported it back in 2023. I think the tmp disk denies cleanup from the host. Try mounting an NFS disk for tmp and see if the issue persists there.

stefanprodan · 2024-04-18T16:23:31Z

Another test that you could so is set tmp to RAM and check if tmp gets cleared. Here is an example of how to mount a ram disk https://fluxcd.io/flux/installation/configuration/vertical-scaling/#enable-in-memory-kustomize-builds

souleb · 2024-04-19T13:46:39Z

do you see any error in the source-controller logs about cleaning up indexes temporary files?

xeruf · 2024-04-20T07:58:21Z

ah, the raising of limits in https://open.greenhost.net/xeruf/stackspout/-/blob/6e645c6abfe378f3ccbcce7f167da9e5133e46c8/overrides/source-controller-patch.yaml did not work:

so upon failure of processing, it leaves the temporary file in place

stefanprodan · 2024-04-20T08:07:14Z

That patch looks wrong to me, there is no name/namespace in a Kustomize config file nor can you apply such a thing with Flux. See here how you can configure Flux at bootstrap time: https://fluxcd.io/flux/installation/configuration/boostrap-customization/

xeruf · 2024-04-20T14:56:15Z

Thanks for the hint!
Either way, if it fails to process a repo file it should not infinitely keep them.

stefanprodan changed the title ~~Unreasonable storage use for large helmrepository~~ tmp storage does not get cleaned up when large helm repository index file fails to process Apr 20, 2024

stefanprodan added bug Something isn't working area/helm Helm related issues and pull requests area/storage Storage related issues and pull requests labels Apr 20, 2024

souleb mentioned this issue Apr 22, 2024

Bind cached helm index to the maximum index size #1457

Merged

souleb closed this as completed in #1457 Apr 22, 2024

stefanprodan mentioned this issue Apr 24, 2024

"failed to load Helm repository" results in data storage leak #1462

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tmp storage does not get cleaned up when large helm repository index file fails to process #1451

tmp storage does not get cleaned up when large helm repository index file fails to process #1451

xeruf commented Apr 18, 2024

stefanprodan commented Apr 18, 2024

stefanprodan commented Apr 18, 2024

xeruf commented Apr 18, 2024

stefanprodan commented Apr 18, 2024

stefanprodan commented Apr 18, 2024

souleb commented Apr 19, 2024 •

edited

Loading

xeruf commented Apr 20, 2024

stefanprodan commented Apr 20, 2024 •

edited

Loading

xeruf commented Apr 20, 2024

tmp storage does not get cleaned up when large helm repository index file fails to process #1451

tmp storage does not get cleaned up when large helm repository index file fails to process #1451

Comments

xeruf commented Apr 18, 2024

stefanprodan commented Apr 18, 2024

stefanprodan commented Apr 18, 2024

xeruf commented Apr 18, 2024

stefanprodan commented Apr 18, 2024

stefanprodan commented Apr 18, 2024

souleb commented Apr 19, 2024 • edited Loading

xeruf commented Apr 20, 2024

stefanprodan commented Apr 20, 2024 • edited Loading

xeruf commented Apr 20, 2024

souleb commented Apr 19, 2024 •

edited

Loading

stefanprodan commented Apr 20, 2024 •

edited

Loading