k8s: Error out when annotation can not be set #13711

RafalKorepta · 2023-09-27T09:27:53Z

Given the following operator logic

redpanda/src/go/k8s/controllers/redpanda/cluster_controller.go

Lines 266 to 275 in 5176601

    
           // The following should be at the last part as it requires AdminAPI to be running 
        
           if err := r.setPodNodeIDAnnotation(ctx, &vectorizedCluster, log, ar); err != nil { 
        
           	return ctrl.Result{}, fmt.Errorf("setting pod node_id annotation: %w", err) 
        
           } 
        
           if err := r.setPodNodeIDLabel(ctx, &vectorizedCluster, log, ar); err != nil { 
        
           	return ctrl.Result{}, fmt.Errorf("setting pod node_id label: %w", err) 
        
           } 
        
           if err := r.decommissionGhostBrokers(ctx, &vectorizedCluster, log, ar); err != nil { 
        
           	return ctrl.Result{}, fmt.Errorf("deleting ghost brokers: %w", err) 
        
           }

The chain of events that causes wrong decommission logic is:

setPodNodeIDAnnotation failed to get the Redpanda Node ID
setPodNodeIDAnnotation does not return an error, but only it logs it out
setPodNodeIDLabel was able to get Redpanda Node ID, so broker was able to boot up and register its Node ID
decommissionGhostBrokers is performing filtering based on Pod annotation which is incorrect

Logs from the wrong execution:

{"level":"info","ts":"2023-09-27T06:51:41.644Z","logger":"ClusterReconciler.Reconcile.setPodNodeIDLabel","msg":"setting node-id label","controller":"cluster","controllerGroup":"redpanda.vectorized.io","controllerKind":"Cluster","Cluster":{"name":"repanda-cluster","namespace":"redpanda"},"namespace":"redpanda","name":"repanda-cluster","reconcileID":"57bf8b57-60b5-4248-bedd-3ffcaa5c4e07","pod-name":"repanda-cluster-1","new-node-id":3}
{"level":"info","ts":"2023-09-27T06:51:41.718Z","logger":"ClusterReconciler.Reconcile.setPodNodeIDAnnotation","msg":"decommission old node-id","controller":"cluster","controllerGroup":"redpanda.vectorized.io","controllerKind":"Cluster","Cluster":{"name":"repanda-cluster","namespace":"redpanda"},"namespace":"redpanda","name":"repanda-cluster","reconcileID":"fa64d3d8-02cf-4c1f-8a67-13a2dfb339ea","pod-name":"repanda-cluster-1","old-node-id":1}
{"level":"info","ts":"2023-09-27T06:51:41.758Z","logger":"ClusterReconciler.Reconcile.setPodNodeIDAnnotation","msg":"setting node-id annotation","controller":"cluster","controllerGroup":"redpanda.vectorized.io","controllerKind":"Cluster","Cluster":{"name":"repanda-cluster","namespace":"redpanda"},"namespace":"redpanda","name":"repanda-cluster","reconcileID":"fa64d3d8-02cf-4c1f-8a67-13a2dfb339ea","pod-name":"repanda-cluster-1","new-node-id":3}
{"level":"error","ts":"2023-09-27T06:51:42.497Z","msg":"Reconciler error","controller":"cluster","controllerGroup":"redpanda.vectorized.io","controllerKind":"Cluster","Cluster":{"name":"repanda-cluster","namespace":"redpanda"},"namespace":"redpanda","name":"repanda-cluster","reconcileID":"fa64d3d8-02cf-4c1f-8a67-13a2dfb339ea","error":"deleting ghost brokers: failed to decommission ghost broker: request PUT http://repanda-cluster-0.repanda-cluster.redpanda.svc.cluster.local/.:9644/v1/brokers/1/decommission failed: Bad Request, body: \"{\\\"message\\\": \\\"can not update broker 1 state, invalid state transition requested\\\", \\\"code\\\": 400}\"\n"}

Backports Required

Release Notes

Bug Fixes

do not allow unsafe decommission procedure if annotation was not changed

alejandroEsc

lgtm, it seems the major change was the checking and returning early if brokers ordinal matchines the nodeID? was this the case you were referring to in your preamble?

src/go/k8s/controllers/cluster.redpanda.com/topic_controller.go

src/go/k8s/controllers/redpanda/cluster_controller.go

alejandroEsc · 2023-09-27T15:12:32Z

src/go/k8s/controllers/test/cluster_controller_scale_test.go

@@ -132,6 +132,7 @@ var _ = Describe("Redpanda cluster scale resource", func() {

 			By("Scaling down only when decommissioning is done")
 			Expect(testAdminAPI.RemoveBroker(2)).To(BeTrue())
+			testAdminAPI.AddGhostBroker(admin.Broker{NodeID: 2, MembershipStatus: admin.MembershipStatusDraining})


interesting, so this needed to have locks so sequence is important?

Our reconciliation has changed and we require Pod Annotation with Redpanda Node ID. This test was written without our constraints. To mimic actual world example I added ghost borker function.

RafalKorepta · 2023-09-27T15:32:49Z

it seems the major change was the checking and returning early if brokers ordinal matchines the nodeID?

I'm not fully understand.

was this the case you were referring to in your preamble?

As I described in the cover letter, the problem is with the timing. We can not run unsafe decommission function if annotation for every Pod doesn't match the reality.

alejandroEsc

Can you move the v2 tests changes in another PR to better isolate problems?

Given the following operator logic https://github.com/redpanda-data/redpanda/blob/51766019d6e85b7252a0c637043fc5d9edbc4673/src/go/k8s/controllers/redpanda/cluster_controller.go#L266-L275 The chain of events that causes wrong decommission logic is: * `setPodNodeIDAnnotation` failed to get the Redpanda Node ID * `setPodNodeIDAnnotation` does not return an error, but only it logs it out * `setPodNodeIDLabel` was able to get Redpanda Node ID, so broker was able to boot up and register its Node ID * `decommissionGhostBrokers` is performing filtering based on Pod annotation which is incorrect Logs from the wrong execution: ``` {"level":"info","ts":"2023-09-27T06:51:41.644Z","logger":"ClusterReconciler.Reconcile.setPodNodeIDLabel","msg":"setting node-id label","controller":"cluster","controllerGroup":"redpanda.vectorized.io","controllerKind":"Cluster","Cluster":{"name":"repanda-cluster","namespace":"redpanda"},"namespace":"redpanda","name":"repanda-cluster","reconcileID":"57bf8b57-60b5-4248-bedd-3ffcaa5c4e07","pod-name":"repanda-cluster-1","new-node-id":3} {"level":"info","ts":"2023-09-27T06:51:41.718Z","logger":"ClusterReconciler.Reconcile.setPodNodeIDAnnotation","msg":"decommission old node-id","controller":"cluster","controllerGroup":"redpanda.vectorized.io","controllerKind":"Cluster","Cluster":{"name":"repanda-cluster","namespace":"redpanda"},"namespace":"redpanda","name":"repanda-cluster","reconcileID":"fa64d3d8-02cf-4c1f-8a67-13a2dfb339ea","pod-name":"repanda-cluster-1","old-node-id":1} {"level":"info","ts":"2023-09-27T06:51:41.758Z","logger":"ClusterReconciler.Reconcile.setPodNodeIDAnnotation","msg":"setting node-id annotation","controller":"cluster","controllerGroup":"redpanda.vectorized.io","controllerKind":"Cluster","Cluster":{"name":"repanda-cluster","namespace":"redpanda"},"namespace":"redpanda","name":"repanda-cluster","reconcileID":"fa64d3d8-02cf-4c1f-8a67-13a2dfb339ea","pod-name":"repanda-cluster-1","new-node-id":3} {"level":"error","ts":"2023-09-27T06:51:42.497Z","msg":"Reconciler error","controller":"cluster","controllerGroup":"redpanda.vectorized.io","controllerKind":"Cluster","Cluster":{"name":"repanda-cluster","namespace":"redpanda"},"namespace":"redpanda","name":"repanda-cluster","reconcileID":"fa64d3d8-02cf-4c1f-8a67-13a2dfb339ea","error":"deleting ghost brokers: failed to decommission ghost broker: request PUT http://repanda-cluster-0.repanda-cluster.redpanda.svc.cluster.local/.:9644/v1/brokers/1/decommission failed: Bad Request, body: \"{\\\"message\\\": \\\"can not update broker 1 state, invalid state transition requested\\\", \\\"code\\\": 400}\"\n"} ```

In the case when broker is removed from the mock admin API the controller is failing always as Pod can not set up annotation with Node ID. The decommission is never called, so that StatefulSet is not updated. Integration tests are stuck due to that.

…sion e2e test

RafalKorepta · 2023-10-03T18:23:01Z

you move the v2 tests changes in another PR to better isolate

Here you go #13898.

I removed all commits that was trying to fix e2e v2 tests. Please review

alejandroEsc

lgtm, a lot of maintenance changes but good cleaning

src/go/k8s/tests/e2e/update-image-tls-client-auth-external-ca/12-clean.yaml

vbotbuildovich · 2023-10-03T19:58:05Z

/backport v23.2.x

RafalKorepta added area/k8s k8s/tests labels Sep 27, 2023

RafalKorepta requested a review from a team as a code owner September 27, 2023 09:27

RafalKorepta force-pushed the rk/fix-decommission-process branch 2 times, most recently from c3f796e to b7a1b25 Compare September 27, 2023 14:16

alejandroEsc previously approved these changes Sep 27, 2023

View reviewed changes

RafalKorepta dismissed alejandroEsc’s stale review via 6bc86e1 September 27, 2023 15:57

RafalKorepta force-pushed the rk/fix-decommission-process branch 21 times, most recently from 630c0a1 to 37b618b Compare October 3, 2023 11:22

alejandroEsc reviewed Oct 3, 2023

View reviewed changes

RafalKorepta added 14 commits October 3, 2023 20:20

k8s: Fix error format

2b21d59

k8s: Fix wrong copy function usage

0f8777c

k8s: Lower the kind node count used in test

5c1646a

k8s: Delete lost-rp-decommission cluster from e2e test case

e512ad9

k8s: Delete all PersistentVolumeClaim resources from managed-decommis…

ade234e

…sion e2e test

k8s: Assert external connectivity Cluster

27771ba

k8s: Allow scheduling pods into control plane kind node

2b25124

k8s: Do not allow exponential back off when annotation is not set

113a484

k8s: Report all resources custom resources from cert manager

64830df

k8s: Remove all pods created by jobs

b36f767

k8s: Return PKI resource creation to allow retry

65d3535

k8s: Report statefulset and Redpanda resources when CI fails

55835ab

RafalKorepta force-pushed the rk/fix-decommission-process branch from 37b618b to 55835ab Compare October 3, 2023 18:21

alejandroEsc self-requested a review October 3, 2023 19:32

alejandroEsc approved these changes Oct 3, 2023

View reviewed changes

src/go/k8s/tests/e2e/update-image-tls-client-auth-external-ca/12-clean.yaml Show resolved Hide resolved

RafalKorepta merged commit 0eecc50 into redpanda-data:dev Oct 3, 2023
15 checks passed

vbotbuildovich mentioned this pull request Oct 3, 2023

[v23.2.x] k8s: Error out when annotation can not be set #13901

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

k8s: Error out when annotation can not be set #13711

k8s: Error out when annotation can not be set #13711

RafalKorepta commented Sep 27, 2023 •

edited

Loading

alejandroEsc left a comment

alejandroEsc Sep 27, 2023

RafalKorepta Sep 27, 2023

RafalKorepta commented Sep 27, 2023

alejandroEsc left a comment

RafalKorepta commented Oct 3, 2023

alejandroEsc left a comment

vbotbuildovich commented Oct 3, 2023

	// The following should be at the last part as it requires AdminAPI to be running
	if err := r.setPodNodeIDAnnotation(ctx, &vectorizedCluster, log, ar); err != nil {
	return ctrl.Result{}, fmt.Errorf("setting pod node_id annotation: %w", err)
	}
	if err := r.setPodNodeIDLabel(ctx, &vectorizedCluster, log, ar); err != nil {
	return ctrl.Result{}, fmt.Errorf("setting pod node_id label: %w", err)
	}
	if err := r.decommissionGhostBrokers(ctx, &vectorizedCluster, log, ar); err != nil {
	return ctrl.Result{}, fmt.Errorf("deleting ghost brokers: %w", err)
	}

k8s: Error out when annotation can not be set #13711

k8s: Error out when annotation can not be set #13711

Conversation

RafalKorepta commented Sep 27, 2023 • edited Loading

Backports Required

Release Notes

Bug Fixes

alejandroEsc left a comment

Choose a reason for hiding this comment

alejandroEsc Sep 27, 2023

Choose a reason for hiding this comment

RafalKorepta Sep 27, 2023

Choose a reason for hiding this comment

RafalKorepta commented Sep 27, 2023

alejandroEsc left a comment

Choose a reason for hiding this comment

RafalKorepta commented Oct 3, 2023

alejandroEsc left a comment

Choose a reason for hiding this comment

vbotbuildovich commented Oct 3, 2023

RafalKorepta commented Sep 27, 2023 •

edited

Loading