KubeNodeNotReady¶
Meaning¶
KubeNodeNotReady alert is fired when a Kubernetes node is not in Ready
state for a certain period. In this case, the node is not able to host any new
pods as described [here][KubeNode].
Impact¶
The performance of the cluster deployments is affected, depending on the overall workload and the type of the node.
Diagnosis¶
The notification details should list the node that's not ready. For Example:
- alertname = KubeNodeNotReady
...
- node = node1.example.com
...
Login to the cluster. Check the status of that node:
$ kubectl get node $NODE -o yaml
The output should describe why the node isn't ready (e.g.: timeouts reaching the API or kubelet).
Mitigation¶
Once, the problem was resolved that prevented node from being replaced, the instance should be terminated.
See KubeNode See node problem detector