You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The checking for redis requires all nodes to be availble for it to return healthy.
We recenlt get caught in an issue due to we use the healthcheck for readiness and liveness probe for container, During our redis cluster maintenance, we need to upgrade node by node. We need to take down a node which cause the healthcheck to return unhealthy in the response, and hence, our application is down.
Why is this needed:
The reason that I feel this is unexpected is because redis cluster is still working if just one slave node down. It may be unstable, but still working.
Is this checking behavior is expected?
Is there any risk if we simply connect to one node and make sure cluster_state:ok?
The text was updated successfully, but these errors were encountered:
I am facing a similar problem statement where a certain number N nodes up still indicates my system is healthy due to only needing a quorum of N Redis nodes to do work. I am exploring a custom health check to implement this logic. However, since the use case is seemingly common for Redis it would be nice to have built-in aggregated healthcheck for multiple nodes with requirement that N nodes up is still healthy.
What would you like to be added:
The checking for redis requires all nodes to be availble for it to return healthy.
We recenlt get caught in an issue due to we use the healthcheck for readiness and liveness probe for container, During our redis cluster maintenance, we need to upgrade node by node. We need to take down a node which cause the healthcheck to return unhealthy in the response, and hence, our application is down.
Why is this needed:
The reason that I feel this is unexpected is because redis cluster is still working if just one slave node down. It may be unstable, but still working.
Is this checking behavior is expected?
Is there any risk if we simply connect to one node and make sure
cluster_state:ok
?The text was updated successfully, but these errors were encountered: