Flux performance on large repos #3380

ppawiggers · 2022-12-07T20:52:40Z

ppawiggers
Dec 7, 2022

First of all I'd like to say I love working with Flux, thanks for this great piece of software.

We have a single-tenant setup for our solution. At the moment, we run about 1,200 tenants on a single Kubernetes cluster, which works fine.

In our gitops repo that Flux monitors, we have 1,200 directories containing the required Kubernetes manifests (about 20 objects).

Because a full reconciliation run took quite long, we made the following performance improvements to the kustomize-controller Deployment:

Increase CPU and memory
Increase parallel processing (equal to available number of CPU cores)
Use medium: Memory for the emptyDir mount on /tmp to avoid disk throttling
Set the interval of the Kustomizations to 60m.

Now, a full run takes about 13 minutes, which is doable.

However, the problem that we face is the following. When we update the manifests of a single tenant and the Flux source-controller fetches the changes, a full reconciliation run is initiated (meaning, all 1,200 Kustomizations). This means that we need to wait max. 13 min for a simple configuration update (source-controller fetches the repo every 13 minutes). Second, we when update another configuration, all 1,200 Kustomizations are added to kustomize-controller's queue again.

Why doesn't it only reconcile the Kustomizations that have changes in the repo? If this is by design, so you have any recommendations how to improve this setup?

stefanprodan · 2022-12-08T07:55:58Z

stefanprodan
Dec 8, 2022
Maintainer

If this is by design, so you have any recommendations how to improve this setup?

I suggest you set the number of parallel reconciliations to something like 100 or even higher, each reconciliation runs in a Go routine which is a lightweight thread, a single CPU core can handle hundreds of those so this may speed up considerably the whole reconciliation.

Why doesn't it only reconcile the Kustomizations that have changes in the repo?

Due to the nature of Kustomize where a change in a base overlay can affect all the other overlays, Flux can’t skip changes by looking at files, it needs to ask the server, so we do a server-side apply dry-run and we only apply the objects that changed. Also if something in Git didn’t change, it may have drifted in-cluster, to correct drift such as a kubectl apply, we decided to always run the drift detection via server-side apply dry-run on Git revision changes.

If each tenant would have its definition in an OCI artifact, then Flux will only reconcile the tenant that changed.

9 replies

stefanprodan Dec 8, 2022
Maintainer

The delay comes from GKE that's for sure, if you test with Kubernetes Kind you'll see way lower numbers.

stefanprodan Dec 8, 2022
Maintainer

Also please test it without setting CPU limits on GKE to see if the delay comes from CPU throttling or GKE master autoscaler.

ppawiggers Dec 8, 2022
Author

Will do!

I set the CPU limits of the kustomize-controller deployment equal to the number of cores of our nodes (to monitor CPU usage as percentage of its limit), but I can remove that and increase the node for testing.

ppawiggers Dec 15, 2022
Author

I can confirm it works way better in a cluster where the Kubernetes master is properly sized. Secondly, allowing kustomize-controller 16 CPU cores (instead of 4) also seems to help - during a run 100% of the CPU is used. To summarize, the performance improvements I made:

Increase CPU and memory limit
Set concurrent to 50
Use medium: Memory volume for the /tmp mount so it's not limited by Disk I/O

Thanks @stefanprodan for helping out.

stefanprodan Dec 15, 2022
Maintainer

Using a ram disk for /tmp is a really nice tip, I'm going to add this to the bootstrap cheatsheet. Thanks @ppawiggers

hlascelles · 2025-02-21T17:52:46Z

hlascelles
Feb 21, 2025

Did you ever add that to the cheatsheet @stefanprodan?

We've run into this issue, in particular in terms of AWS EBS gp2 burst exhaustion. We've moved to gp3 which has no burst issues, but a memory disk would still be better.

We've proved it manually (not using the helm chart, see below) in our cluster and it works great. Do you still recommend it?

FWIW here is our change that works for the flux helm chart v2.14.1 (which I know is community driven).

kustomizeController:
  volumes:
  - name: temp
    emptyDir:
      sizeLimit: 1000Mi
      medium: Memory
  volumeMounts:
  - name: temp
    mountPath: /tmp

1 reply

stefanprodan Feb 21, 2025
Maintainer

Yes all vertical scaling recommendations are now here https://fluxcd.io/flux/installation/configuration/vertical-scaling/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flux performance on large repos #3380

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments 10 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Flux performance on large repos #3380

ppawiggers Dec 7, 2022

Replies: 2 comments · 10 replies

stefanprodan Dec 8, 2022 Maintainer

stefanprodan Dec 8, 2022 Maintainer

stefanprodan Dec 8, 2022 Maintainer

ppawiggers Dec 8, 2022 Author

ppawiggers Dec 15, 2022 Author

stefanprodan Dec 15, 2022 Maintainer

hlascelles Feb 21, 2025

stefanprodan Feb 21, 2025 Maintainer

ppawiggers
Dec 7, 2022

Replies: 2 comments 10 replies

stefanprodan
Dec 8, 2022
Maintainer

stefanprodan Dec 8, 2022
Maintainer

stefanprodan Dec 8, 2022
Maintainer

ppawiggers Dec 8, 2022
Author

ppawiggers Dec 15, 2022
Author

stefanprodan Dec 15, 2022
Maintainer

hlascelles
Feb 21, 2025

stefanprodan Feb 21, 2025
Maintainer