Configure network-validator and repair-controller to work with IPv6 #12874

alpeb · 2024-07-22T22:35:42Z

The linkerd-cni network-validator container was binding to the IPv4 wildcard and connecting to an IPv4 address. This wasn't breaking things in IPv6 clusters but it was only validating the iptables rules and not the ip6tables ones. This change introduces logic to use addresses according to the value of disableIPv6. If IPv6 is enabled, then the ip6tables rules would get exercised. Note that a more complete change would also exercise both iptables and ip6tables, but for now we're defaulting to ip6tables.

Similarly was the case with repair-controller, but given the IPv4 wildcard was used for the admin server, in IPv6 clusters the kubelet wasn't able to reach the probe endpoints and the container was failing. In this case the fix is just have the admin server bind to [::], which works for IPv4 and IPv6 clusters.

mateiidavid · 2024-07-23T11:38:39Z

charts/partials/templates/_network-validator.tpl

@@ -24,9 +24,13 @@ args:
  - --log-level
  - {{ .Values.networkValidator.logLevel }}
  - --connect-addr
-  - {{ .Values.networkValidator.connectAddr }}
+    {{- if .Values.disableIPv6 }}
+  - "1.1.1.1:{{ .Values.networkValidator.connectPort }}"


So the only problem with this is that users can no longer modify the target now, right? This would probably change the behaviour of the validator. In air gapped environment, for example, if 1.1.1.1 is not accessible, the connection will fail.

Can users still determine whether the failure was caused due to missing iptables rules as opposed to properties of their network? The validator once connected expects a random string to be returned from the server. Obviously with any other host that wouldn't happen, so if the connection succeeded but the actual payload was wrong, we deduce iptables is incorrectly set-up.

Do you think this is a problem? Chances are, the error will be different any way, but I worry that might remove from the strictness of the check.

As for the first point, maybe it is possible to have connectHost: "" in the values.yaml. Which will default to 1.1.1.1 or [fd00::1] if empty and depending on disableIPv6. Otherwise, the address specified would be used. That should provide the flexibility required.

Those are great points. @mateiidavid and @lwj5. I'll add the additional config 👍

@mateiidavid

Fixes #12864 The linkerd-cni network-validator container was binding to the IPv4 wildcard and connecting to an IPv4 address. This wasn't breaking things in IPv6 clusters but it was only validating the iptables rules and not the ip6tables ones. This change introduces logic to use addresses according to the value of `disableIPv6`. If IPv6 is enabled, then the ip6tables rules would get exercised. Note that a more complete change would also exercise both iptables and ip6tables, but for now we're defaulting to ip6tables. This implied changing the helm value `networkValidator.connectAddr` to `connectPort`. @mateiidavid could you please validate if this entry with its simplified doc still makes sense, in light of #12797 ? Similarly was the case with repair-controller, but given the IPv4 wildcard was used for the admin server, in IPv6 clusters the kubelet wasn't able to reach the probe endpoints and the container was failing. In this case the fix is just have the admin server bind to `[::]`, which works for IPv4 and IPv6 clusters.

alpeb · 2024-07-23T17:43:13Z

Ok I ended up putting back the connectAddr config (and not adding connectPort), but defaulting it to empty, to allow the IP family conditionals to operate. If set, it'll just be used. LMKWYT!

mateiidavid

Thanks for the change @alpeb! LGTM

mateiidavid · 2024-07-24T08:54:08Z

charts/linkerd-control-plane/values.yaml

  # for public clusters and a private IP for air-gapped clusters with a port like 20001.
-  connectAddr: "1.1.1.1:20001"
+  # If empty, defaults to 1.1.1.1:20001 and [fd00::1]:20001 for IPv4 and IPv6 respectively.
+  connectAddr:


nit: should we just use "" (in Go it'll be initialised to an empty string anyway)? we use it in some of our config fields, e.g. injectCaFromSecret. You have a better feel than I do for Helm & config surface area in charts than I do though, so I'm ok with whatever decision you make :)

Fair enough, I'll change it to an explicit empty string for consistency, and to not give the impression this could hold something else besides a string.

alpeb requested a review from a team as a code owner July 22, 2024 22:35

alpeb requested a review from mateiidavid July 22, 2024 22:35

alpeb mentioned this pull request Jul 22, 2024

Linkerd CNI repair controller does not listen on IPv6 #12864

Closed

mateiidavid reviewed Jul 23, 2024

View reviewed changes

alpeb added 2 commits July 23, 2024 11:54

Put connectAddr back in, remove connectPort

6d98453

alpeb force-pushed the alpeb/cni-ipv6-wildcard branch from bfeb7b4 to 6d98453 Compare July 23, 2024 17:43

mateiidavid approved these changes Jul 24, 2024

View reviewed changes

Empty string

1e9190a

alpeb merged commit 6603409 into main Jul 24, 2024
41 checks passed

alpeb deleted the alpeb/cni-ipv6-wildcard branch July 24, 2024 14:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configure network-validator and repair-controller to work with IPv6 #12874

Configure network-validator and repair-controller to work with IPv6 #12874

alpeb commented Jul 22, 2024 •

edited

Loading

mateiidavid Jul 23, 2024

lwj5 Jul 23, 2024 •

edited

Loading

alpeb Jul 23, 2024

alpeb commented Jul 23, 2024

mateiidavid left a comment

mateiidavid Jul 24, 2024

alpeb Jul 24, 2024

Configure network-validator and repair-controller to work with IPv6 #12874

Configure network-validator and repair-controller to work with IPv6 #12874

Conversation

alpeb commented Jul 22, 2024 • edited Loading

mateiidavid Jul 23, 2024

Choose a reason for hiding this comment

lwj5 Jul 23, 2024 • edited Loading

Choose a reason for hiding this comment

alpeb Jul 23, 2024

Choose a reason for hiding this comment

alpeb commented Jul 23, 2024

mateiidavid left a comment

Choose a reason for hiding this comment

mateiidavid Jul 24, 2024

Choose a reason for hiding this comment

alpeb Jul 24, 2024

Choose a reason for hiding this comment

alpeb commented Jul 22, 2024 •

edited

Loading

lwj5 Jul 23, 2024 •

edited

Loading