Predefined PrometheusRules#
This page lists the PrometheusRules that KubeStack+ ships out of the box. You can route their alerts to your channels — there is no need to write the rules yourself.
How it works#
The rules below are deployed and evaluated by the platform's Mimir ruler against every workload on the cluster. When one fires, the alert is routed through the same Alertmanager that handles your custom alerts — your AlertmanagerConfig (see Configure application alerting) picks them up automatically based on namespace.
What you get#
Kubernetes apps#
| Name | Description |
|---|---|
| KubePodCrashLooping | Pod Namespace/Pod is restarting N times / 5 minutes. |
| KubePodNotReady | Pod Namespace/Pod has been in a non-ready state for longer than 15 minutes |
| KubeDeploymentGenerationMismatch | Deployment generation for Namespace/Deployment does not match, this indicates that the Deployment has failed but has not been rolled back. |
| KubeDeploymentReplicasMismatch | Deployment Namespace/Deployment has not matched the expected number of replicas for longer than 15 minutes. |
| KubeStatefulSetReplicasMismatch | StatefulSet Namespace/StatefulSet has not matched the expected number of replicas for longer than 15 minutes |
| KubeStatefulSetGenerationMismatch | StatefulSet generation for Namespace/StatefulSet does not match, this indicates that the StatefulSet has failed but has not been rolled back. |
| KubeStatefulSetUpdateNotRolledOut | StatefulSet Namespace/StatefulSet update has not been rolled out. |
| KubeDaemonsetRolloutStuck | Daemonset Namespace/Daemonset has not finished or progressed for at least 15 minutes. |
| KubeContainerWaiting | Pod Namespace/Pod container Container has been in waiting state for longer than 1 hour. |
| KubeDaemonsetNotScheduled | Pods of Daemonset Namespace/Daemonset are not scheduled. |
| KubeJobCompletion | Job Namespace/Job is taking more than 12 hours to complete. |
| KubeJobFailed | Job Namespace/Job failed to complete. Removing failed job after investigation should clear this alert. |
| KubeHpaReplicasMismatch | HPA (Horizontal Pod Autoscaler) Namespace/HPA has not matched the desired number of replicas for longer than 15 minutes. |
| KubeHpaMaxedOut | HPA (Horizontal Pod Autoscaler) Namespace/HPA has been running at max replicas for longer than 15 minutes. |
Kubernetes storage#
| Name | Description |
|---|---|
| KubePersistentVolumeFillingUp | The PersistentVolume claimed by PersistentVolume in Namespace Namespace is only Percentage % free. |
| KubePersistentVolumeErrors | The persistent volume PersistentVolume has status Failed/Pending. |
Next step#
Continue to Log alerts to alert on log content as well as metrics.