Decrease security config updater poll interval to 20s #1069

maxlepikhin · 2025-08-20T16:01:38Z

Description

Reduce poll interval from 120s to 20s.

Issues Resolved

Slow cluster start-up time.

Check List

[x ] Commits are signed per the DCO using --signoff
Unittest added for the new/changed functionality and all unit tests are successful
Customer-visible features documented
No linter warnings (make lint)

If CRDs are changed:

CRD YAMLs updated (make manifests) and also copied into the helm chart
Changes to CRDs documented

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Max Lepikhin <[email protected]>

synhershko · 2025-09-06T19:06:17Z

LGTM - tested and it indeed a necessary fix

prudhvigodithi · 2025-09-08T21:53:46Z

opensearch-operator/pkg/reconcilers/securityconfig.go

 until curl -k --silent https://%s:%v;
 do
-echo 'Waiting to connect to the cluster'; sleep 120;
+echo 'Waiting to connect to the cluster'; sleep 20;


This was added as part of https://github.com/opensearch-project/opensearch-k8s-operator/pull/198/files#diff-3f3f25087560ff69bb8867115997c9c8a5764ce6eebc95577072ad615051db3bR750, initially when I tested with EKS using EBS the security config pod failed because the cluster took time to start. This was when OpenSearch 2.0.0 was released.

Can we have any better way here, like using some cluster health API to poll if the cluster is fully ready connected with all nodes and then run the security config (or leave it as it is?) ?
@maxlepikhin @rursprung @rootxrishabh @synhershko

i'm not using this operator, but: if your cluster has the security plugin installed then you could poll the health endpoint on /_plugins/_security/health? this does not need authentication and will return HTTP 200 if it's alive

Agreed this is not ideal, but the 120 seconds wait doesn't make sense. @prudhvigodithi Let's get this merged and can you open an issue to discuss the right way to perform readiness checks for all possible scenarios?

Signed-off-by: Max Lepikhin <[email protected]>

Decrease security config updater poll interval to 20s

7ac434c

Signed-off-by: Max Lepikhin <[email protected]>

maxlepikhin requested review from idanl21, swoehrl-mw, prudhvigodithi, jochenkressin, pchmielnik and salyh as code owners August 20, 2025 16:01

github-project-automation bot added this to Engineering Effectiveness Board Aug 20, 2025

prudhvigodithi reviewed Sep 8, 2025

View reviewed changes

fix test

7e2ae0f

Signed-off-by: Max Lepikhin <[email protected]>

maxlepikhin force-pushed the feat-963 branch from 6e70fd3 to 7e2ae0f Compare September 10, 2025 22:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Decrease security config updater poll interval to 20s #1069

Decrease security config updater poll interval to 20s #1069

Uh oh!

maxlepikhin commented Aug 20, 2025

Uh oh!

synhershko commented Sep 6, 2025

Uh oh!

prudhvigodithi Sep 8, 2025 •

edited

Loading

Uh oh!

rursprung Sep 9, 2025

Uh oh!

synhershko Sep 9, 2025

Uh oh!

Uh oh!

Decrease security config updater poll interval to 20s #1069

Are you sure you want to change the base?

Decrease security config updater poll interval to 20s #1069

Uh oh!

Conversation

maxlepikhin commented Aug 20, 2025

Description

Issues Resolved

Check List

Uh oh!

synhershko commented Sep 6, 2025

Uh oh!

prudhvigodithi Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rursprung Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

synhershko Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

prudhvigodithi Sep 8, 2025 •

edited

Loading