Deploy kube-prometheus-stack for Cluster Monitoring #47

mischavandenburg · 2025-01-11T16:19:48Z

Deploy kube-prometheus-stack for Cluster Monitoring

Description

We need to implement the kube-prometheus-stack Helm chart to establish comprehensive monitoring and alerting for our k3s cluster. This stack includes Prometheus, Grafana, AlertManager, and various exporters to provide full observability of our cluster and applications.

Requirements

Deploy kube-prometheus-stack via GitOps using Helm
Configure persistent storage for metrics
Set up proper ingress with TLS
Implement basic alerting rules
Configure Grafana dashboards for key metrics

Technical Considerations

Resource requirements and limits
Retention period for metrics
Integration with cert-manager for TLS
Storage class selection for persistence
Integration with identity provider for authentication

Tasks

Initial Stack Deployment
- Create Helm values configuration
- Configure persistent storage
- Test basic functionality
Grafana Setup
- Configure SSO with identity provider
- Import essential dashboards
- Set up data sources
- Configure persistent storage
- Make proposal for user management
AlertManager Configuration
- Set up basic alert rules
- Configure notification channels (Slack)
- Test alert delivery

Acceptance Criteria

Stack is successfully deployed and operational
Persistent storage is properly configured
Ingress with TLS is working
Basic alerts are functional
Key dashboards are available
Authentication is working
Metrics retention is configured
Documentation is complete

Important Dashboards to Include

Node metrics
Kubernetes cluster overview
Persistent volumes
API server metrics
etcd metrics
cert-manager status
CloudNativePG databases
ArgoCD status

Basic Alerts to Implement

Node status
Pod status
Storage capacity
Certificate expiration
Backup status
High CPU/Memory usage
Persistent volume status

Additional Notes

Document all the setup steps for new deployments in the future

vikramreddym · 2025-01-12T22:49:05Z

I am interested in working on this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploy kube-prometheus-stack for Cluster Monitoring #47

Deploy kube-prometheus-stack for Cluster Monitoring #47

mischavandenburg commented Jan 11, 2025

vikramreddym commented Jan 12, 2025

Deploy kube-prometheus-stack for Cluster Monitoring #47

Deploy kube-prometheus-stack for Cluster Monitoring #47

Comments

mischavandenburg commented Jan 11, 2025

Deploy kube-prometheus-stack for Cluster Monitoring

Description

Requirements

Technical Considerations

Tasks

Acceptance Criteria

Important Dashboards to Include

Basic Alerts to Implement

Additional Notes

vikramreddym commented Jan 12, 2025