Question for HealthChecks.Redis behavior #2288

cwhsu1984 · 2024-08-20T05:29:17Z

What would you like to be added:

The checking for redis requires all nodes to be availble for it to return healthy.
We recenlt get caught in an issue due to we use the healthcheck for readiness and liveness probe for container, During our redis cluster maintenance, we need to upgrade node by node. We need to take down a node which cause the healthcheck to return unhealthy in the response, and hence, our application is down.

Why is this needed:

The reason that I feel this is unexpected is because redis cluster is still working if just one slave node down. It may be unstable, but still working.
Is this checking behavior is expected?
Is there any risk if we simply connect to one node and make sure cluster_state:ok?

The text was updated successfully, but these errors were encountered:

idkburkes · 2024-08-30T17:02:01Z

I am facing a similar problem statement where a certain number N nodes up still indicates my system is healthy due to only needing a quorum of N Redis nodes to do work. I am exploring a custom health check to implement this logic. However, since the use case is seemingly common for Redis it would be nice to have built-in aggregated healthcheck for multiple nodes with requirement that N nodes up is still healthy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question for HealthChecks.Redis behavior #2288

Question for HealthChecks.Redis behavior #2288

cwhsu1984 commented Aug 20, 2024

idkburkes commented Aug 30, 2024

Question for HealthChecks.Redis behavior #2288

Question for HealthChecks.Redis behavior #2288

Comments

cwhsu1984 commented Aug 20, 2024

idkburkes commented Aug 30, 2024