Skip to content

feat(monitoring): add PrometheusRule with Stellar alerts#343

Open
Petah1 wants to merge 8 commits intoOtowoOrg:mainfrom
Petah1:blackboxai/prometheus-alerts-rules
Open

feat(monitoring): add PrometheusRule with Stellar alerts#343
Petah1 wants to merge 8 commits intoOtowoOrg:mainfrom
Petah1:blackboxai/prometheus-alerts-rules

Conversation

@Petah1
Copy link
Copy Markdown

@Petah1 Petah1 commented Mar 26, 2026

Adds charts/stellar-operator/templates/prometheusrule.yaml with 4 production alerts:

  1. StellarNodeSyncLag: ledger_age/ingestion_lag > 100 ledgers (warning 5m)
  2. StellarNodeMemoryPressure: RSS > 90% limit (warning 10m)
  3. StellarOperatorReconcileErrors: error rate > 0.1/5m (critical)
  4. StellarHistoryArchiveUnresponsive: Horizon ingestion_lag > 300s (critical)

Configurable via values.prometheusAlerts.* Helm values. Enabled by default.

Tested: helm lint, template renders correctly.
No code changes, cargo test passes.

closes #224

DELL USER and others added 6 commits March 26, 2026 14:31
Adds charts/stellar-operator/templates/prometheusrule.yaml with 4 production alerts:

1. StellarNodeSyncLag: ledger_age/ingestion_lag > 100 ledgers (warning 5m)
2. StellarNodeMemoryPressure: RSS > 90% limit (warning 10m)
3. StellarOperatorReconcileErrors: error rate > 0.1/5m (critical)
4. StellarHistoryArchiveUnresponsive: Horizon ingestion_lag > 300s (critical)

Configurable via values.prometheusAlerts.* Helm values.
Enabled by default.

Tested: helm lint, template renders correctly.
No code changes, cargo test passes.

closes prometheus alerts AC.
…ng, add icon

- values.yaml: prometheusAlerts  monitoring.prometheusAlerts (fixes schema violation)
- templates/prometheusrule.yaml: Update template paths
- Chart.yaml: Add recommended icon
- helm lint now passes cleanly
Fixes helm lint errors:
- undefined  var
- YAML parse at line 26
- humanize not defined

Changes:
- \{{ \{{  }} \ }}\ literals
- Removed filter
- Fixed YAML/indent

helm lint clean.
@Petah1 Petah1 force-pushed the blackboxai/prometheus-alerts-rules branch from 29c6417 to 75b4c8f Compare March 29, 2026 23:00
@OtowoSamuel
Copy link
Copy Markdown
Contributor

@Petah1
Please resolve your merge conflict

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Proactive Alerting: PrometheusRule manifests for Stellar Node Health

2 participants