-
Notifications
You must be signed in to change notification settings - Fork 143
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add active alerts for alternator latencies #2402
Comments
There are a few issues with the suggested alert, the main one being that looking at a bucket like that most likely doesn't do what you expect it to do. I suggest using a P99 or P95 and setting some hardcoded limit, similar to what we do with CQL. |
Hi @amnonh, Thanks for the reply. I have some followup questions: I inspected the "P95 Latencies" panel on the "Alternator" dashboard, and I see the following expression:
Would a similar calculation be used in an alert like this? Or would there be a better way to do this? |
Hi @wpaven @ruthea @ManjotS @pdbossman, We should probably be specific about what we want to alert on. Something like this?: Alert if the p95 latency for Or would someone define it differently? Or do we want to define alerts for each operation, not just |
I would rather look for patterns because it will differ per customer and cluster. Here is my latest iteration
|
latest:
|
We don't want this one widely distributed. This should be an SRE issue. |
Please close this issue |
Please make sure that this is a feature request.
System information
Describe the feature and the current behavior/state.
Scylla Monitoring provides great panels showing Alternator latencies, but it does not currently provide alerts when latencies deviate from the average.
Who will benefit with this feature?
Users who use Alternator and depend on low latencies.
Any Other info.
Here is my alert configuration:
The text was updated successfully, but these errors were encountered: