Add cni-repair-controller to linkerd-cni DaemonSet #11699

alpeb · 2023-12-05T15:14:28Z

Followup to linkerd/linkerd2-proxy-init#306
Fixes #11073

This adds the cni-repair-controller container to the linkerd-cni DaemonSet, along with its config in values.yaml. Note that this is disabled by default; to enable set repairController.enabled=true.

Also the linkerd-cni's version is bumped, to contain the new binary for this controller.

Finally,** priorityClassName: system-cluster-critical was added to the DaemonSet, which should signal the scheduler to give it priority over application pods, but this has proven to not be reliable, thus the need of the new controller.

Followup to linkerd/linkerd2-proxy-init#306 Fixes #11073 This adds the `reinitialize-pods` container to the `linkerd-cni` DaemonSet, along with its config in `values.yaml`. Also the `linkerd-cni`'s version is bumped, to contain the new binary for this controller. ## TO-DOs - Integration test

jdinsel-xealth · 2023-12-11T17:02:24Z

Could this also be marked with the Fixes #11735 tag?

(Note this will fail until linkerd/linkerd2#11699 lands) The `integration-cni-plugin.yml` workflow (formerly known as `cni-plugin-integration.yml`) has been expanded to run the new recipe `reinitialize-pods-integration`, which performs the following steps: - Rebuilds the `linkerd-reinitialize-pods` crate and `cni-plugin`. The `Dockerfile-cni-plugin` file has been refactored to have two main targets `runtime` and `runtime-test`, the latter picking the `linkerd-reinitialize-pods` that has just been built locally. - Creates a new cluster at version `v1.27.6-k3s1` (version required for Calico to work) - Triggers a new `./reinitialize-pods/integration/run.sh` script which: - Installs Calico - Installs the latest linkerd-edge CLI - Installs `linkerd-cni` and wait for it to become ready - Install the linkerd control plane in CNI mode - Install a `pause` DaemonSet The `linkerd-cni` instance has been configured to include an extra initContainer that will delay its start for 15s. Since we waited for it to become ready, this doesn't affect the initial install. But then a new node is added to the cluster, and this delay allows for the new `pause` DaemonSet replica to start before the full CNI config is ready, so we can observe its failure to come up. Once the new `linkerd-cni` replica becomes ready we observe how the `pause` failed replica is replaced by a new healthy one.

alpeb · 2023-12-12T09:49:43Z

Could this also be marked with the Fixes #11735 tag?

I've replied to that issue. Given that's a Talos system, that appears to be hitting #7945.

ramsateesh · 2023-12-14T02:24:53Z

This is perfect. When can I test this ?

alpeb · 2023-12-14T09:29:16Z

This is perfect. When can I test this ?

This should be shipped with an edge release in the coming weeks 🙂

Fixes linkerd/linkerd2#11073 This fixes the issue of injected pods that cannot acquire proper network config because `linkerd-cni` and/or the cluster's network CNI haven't fully started. They are left in a permanent crash loop and once CNI is ready, they need to be restarted externally, which is what this controller does. This controller "`linkerd-cni-repair-controller`" watches over events on pods in the current node, which have been injected but are in a terminated state and whose `linkerd-network-validator` container exited with code 95, and proceeds to delete them so they can restart with a proper network config. The controller is to be deployed as an additional container in the `linkerd-cni` DaemonSet (addressed in linkerd/linkerd2#11699). This exposes two custom counter metrics: `linkerd_cni_repair_controller_queue_overflow` (in the spirit of the destination controller's `endpoint_updates_queue_overflow`) and `linkerd_cni_repair_controller_deleted`

(Note this will fail until linkerd/linkerd2#11699 lands) The `integration-cni-plugin.yml` workflow (formerly known as `cni-plugin-integration.yml`) has been expanded to run the new recipe `cni-repair-controller-integration`, which performs the following steps: - Rebuilds the `linkerd-cni-repair-controller` crate and `cni-plugin` - Creates a new cluster at version `v1.27.6-k3s1` (version required for Calico to work) - Triggers a new `./cni-repair-controller/integration/run.sh` script which: - Installs Calico - Installs the latest linkerd-edge CLI - Installs `linkerd-cni` and wait for it to become ready - Install the linkerd control plane in CNI mode - Install a `pause` DaemonSet The `linkerd-cni` instance has been configured to include an extra initContainer that will delay its start for 15s. Since we waited for it to become ready, this doesn't affect the initial install. But then a new node is added to the cluster, and this delay allows for the new `pause` DaemonSet replica to start before the full CNI config is ready, so we can observe its failure to come up. Once the new `linkerd-cni` replica becomes ready we observe how the `pause` failed replica is replaced by a new healthy one.

charts/linkerd2-cni/templates/cni-plugin.yaml

olix0r · 2024-01-05T02:06:46Z

charts/linkerd2-cni/values.yaml

+# Defaults to system-cluster-critical so it signals the scheduler to start
+# before application pods, but after CNI plugins (whose priorityClassName is
+# system-node-critical). This isn't strictly enforced.
+priorityClassName: "system-cluster-critical"


Is this needed for the cni-repair controller specifically? If not, can you pull it out into a separate change?

If CNI plugins should run at system-node-critical, why wouldn't the Linkerd CNI run at system-node-critical? If we don't have that as a default now, is there a reason for that? I.e. are there any downsides to setting this as a default?

If we omit this change from this PR, this change feels less risky to me.

It's not required for this PR. The reasoning for using system-cluster-critical was to allow for the main CNI plugin in the cluster to run first, lessening the chance to run into the race condition the repair controller attempts to fix. But it appears these class names are either best-effort or the prioritization mechanism is simply not implemented as advertised, according to my testing. I'll remove this for now.

charts/linkerd2-cni/values.yaml

This edge release introduces a number of different fixes and improvements. More notably, it introduces a new `cni-repair-controller` binary to the CNI plugin image. The controller will automatically restart pods that have not received their iptables configuration. * Removed shortnames from Tap API resources to avoid colliding with existing Kubernetes resources ([#11816]; fixes [#11784]) * Introduced a new ExternalWorkload CRD to support upcoming mesh expansion feature ([#11805]) * Changed `MeshTLSAuthentication` resource validation to allow SPIFFE URI identities ([#11882]) * Introduced a new `cni-repair-controller` to the `linkerd-cni` DaemonSet to automatically restart misconfigured pods that are missing iptables rules ([#11699]; fixes [#11073]) * Fixed a `"duplicate metrics"` warning in the multicluster service-mirror component ([#11875]; fixes [#11839]) * Added metric labels and weights to `linkerd diagnostics endpoints` json output ([#11889]) * Changed how `Server` updates are handled in the destination service. The change will ensure that during a cluster resync, consumers won't be overloaded by redundant updates ([#11907]) * Changed `linkerd install` error output to add a newline when a Kubernetes client cannot be successfully initialised [#11816]: #11816 [#11784]: #11784 [#11805]: #11805 [#11882]: #11882 [#11699]: #11699 [#11073]: #11073 [#11875]: #11875 [#11839]: #11839 [#11889]: #11889 [#11907]: #11907 [#11917]: #11917 Signed-off-by: Matei David <[email protected]>

This edge release introduces a number of different fixes and improvements. More notably, it introduces a new `cni-repair-controller` binary to the CNI plugin image. The controller will automatically restart pods that have not received their iptables configuration. * Removed shortnames from Tap API resources to avoid colliding with existing Kubernetes resources ([#11816]; fixes [#11784]) * Introduced a new ExternalWorkload CRD to support upcoming mesh expansion feature ([#11805]) * Changed `MeshTLSAuthentication` resource validation to allow SPIFFE URI identities ([#11882]) * Introduced a new `cni-repair-controller` to the `linkerd-cni` DaemonSet to automatically restart misconfigured pods that are missing iptables rules ([#11699]; fixes [#11073]) * Fixed a `"duplicate metrics"` warning in the multicluster service-mirror component ([#11875]; fixes [#11839]) * Added metric labels and weights to `linkerd diagnostics endpoints` json output ([#11889]) * Changed how `Server` updates are handled in the destination service. The change will ensure that during a cluster resync, consumers won't be overloaded by redundant updates ([#11907]) * Changed `linkerd install` error output to add a newline when a Kubernetes client cannot be successfully initialised ([#11917]) [#11816]: #11816 [#11784]: #11784 [#11805]: #11805 [#11882]: #11882 [#11699]: #11699 [#11073]: #11073 [#11875]: #11875 [#11839]: #11839 [#11889]: #11889 [#11907]: #11907 [#11917]: #11917 Signed-off-by: Matei David <[email protected]>

Followup to linkerd/linkerd2-proxy-init#306 Fixes #11073 This adds the `reinitialize-pods` container to the `linkerd-cni` DaemonSet, along with its config in `values.yaml`. Also the `linkerd-cni`'s version is bumped, to contain the new binary for this controller.

This stable release adds a cni-repair-controller which fixes the issue of injected pods that cannot acquire proper network config because linkerd-cni and/or the cluster's network CNI haven't fully started ([#11699]). It also fixes a bug in the destination controller where having a large number of Server resources could cause the destination controller to use an excessive amount of CPU ([#11907]). Finally, it fixes a conflict with tap resource shortnames which was causing warnings from kubectl v1.29.0+ ([#11816]). [#11699]: #11699 [#11907]: #11907 [#11816]: #11816

(Note this will fail until linkerd/linkerd2#11699 lands) The `integration-cni-plugin.yml` workflow (formerly known as `cni-plugin-integration.yml`) has been expanded to run the new recipe `cni-repair-controller-integration`, which performs the following steps: - Rebuilds the `linkerd-cni-repair-controller` crate and `cni-plugin` - Creates a new cluster at version `v1.27.6-k3s1` (version required for Calico to work) - Triggers a new `./cni-repair-controller/integration/run.sh` script which: - Installs Calico - Installs the latest linkerd-edge CLI - Installs `linkerd-cni` and wait for it to become ready - Install the linkerd control plane in CNI mode - Install a `pause` DaemonSet The `linkerd-cni` instance has been configured to include an extra initContainer that will delay its start for 15s. Since we waited for it to become ready, this doesn't affect the initial install. But then a new node is added to the cluster, and this delay allows for the new `pause` DaemonSet replica to start before the full CNI config is ready, so we can observe its failure to come up. Once the new `linkerd-cni` replica becomes ready we observe how the `pause` failed replica is replaced by a new healthy one.

nathanmcgarvey-modopayments · 2024-01-24T18:57:56Z

Does this further the fix for #8120 as well? This seems like it was #2 in that issue's proposed fixes.

alpeb · 2024-01-24T20:53:20Z

Does this further the fix for #8120 as well? This seems like it was #2 in that issue's proposed fixes.

That is correct 👍

alpeb added the area/cni label Dec 5, 2023

alpeb requested a review from a team as a code owner December 5, 2023 15:14

alpeb mentioned this pull request Dec 5, 2023

cni-repair controller linkerd/linkerd2-proxy-init#306

Merged

alpeb force-pushed the alpeb/linkerd-reinitialize-pods branch from 178c9f8 to 1ca137a Compare December 5, 2023 15:34

Add RBAC for publishing events, and env var for pod name

0e943f4

alpeb force-pushed the alpeb/linkerd-reinitialize-pods branch from a38b17e to 0e943f4 Compare December 6, 2023 16:25

alpeb added 2 commits December 6, 2023 17:04

Remove duped priorityClassName

113e3cf

Remove commented example no longer required

b014077

jdinsel-xealth mentioned this pull request Dec 11, 2023

Failed to validate networking configuration. Please ensure iptables rules are rewriting traffic as expected. #11735

Closed

alpeb marked this pull request as draft December 11, 2023 19:53

alpeb mentioned this pull request Dec 11, 2023

Integration test for reinitialize-pods linkerd/linkerd2-proxy-init#309

Closed

alpeb changed the title ~~Add reinitialize-pods controller to linkerd-cni DaemonSet~~ Add cni-repair-controller to linkerd-cni DaemonSet Jan 2, 2024

alpeb mentioned this pull request Jan 2, 2024

Integration test for cni-repair-controller linkerd/linkerd2-proxy-init#316

Merged

Sync with latest changes in the linkerd2-proxy-init repo

127a6f4

alpeb marked this pull request as ready for review January 2, 2024 22:00

olix0r reviewed Jan 5, 2024

View reviewed changes

@olix0r's feedback

5a70742

olix0r approved these changes Jan 5, 2024

View reviewed changes

olix0r merged commit 55d1049 into main Jan 5, 2024
35 checks passed

olix0r deleted the alpeb/linkerd-reinitialize-pods branch January 5, 2024 17:28

mateiidavid mentioned this pull request Jan 12, 2024

edge-24.1.1 #11922

Merged

adleong mentioned this pull request Jan 18, 2024

stable-2.14.9 #11949

Merged

mcharriere mentioned this pull request Jan 29, 2024

Update Linkerd to stable-2.14.9 giantswarm/roadmap#3188

Closed

5 tasks

msr-financial-com mentioned this pull request Apr 23, 2024

After node restart linkerd-cni pod hast to be restarted sometimes #12490

Closed

zip-chanko mentioned this pull request Aug 13, 2024

Karpenter doesn't wait for all the daemon sets to become Ready aws/karpenter-provider-aws#6691

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cni-repair-controller to linkerd-cni DaemonSet #11699

Add cni-repair-controller to linkerd-cni DaemonSet #11699

alpeb commented Dec 5, 2023 •

edited

Loading

jdinsel-xealth commented Dec 11, 2023

alpeb commented Dec 12, 2023

ramsateesh commented Dec 14, 2023

alpeb commented Dec 14, 2023

olix0r Jan 5, 2024

alpeb Jan 5, 2024

nathanmcgarvey-modopayments commented Jan 24, 2024

alpeb commented Jan 24, 2024

Add cni-repair-controller to linkerd-cni DaemonSet #11699

Add cni-repair-controller to linkerd-cni DaemonSet #11699

Conversation

alpeb commented Dec 5, 2023 • edited Loading

jdinsel-xealth commented Dec 11, 2023

alpeb commented Dec 12, 2023

ramsateesh commented Dec 14, 2023

alpeb commented Dec 14, 2023

olix0r Jan 5, 2024

Choose a reason for hiding this comment

alpeb Jan 5, 2024

Choose a reason for hiding this comment

nathanmcgarvey-modopayments commented Jan 24, 2024

alpeb commented Jan 24, 2024

alpeb commented Dec 5, 2023 •

edited

Loading