NodeRAIDDegraded¶
Meaning¶
RAID Array is degraded.
This alert is triggered when a node has a storage configuration with RAID array, and the array is reporting as being in a degraded state due to one or more disk failures.
Impact¶
The affected node could go offline at any moment if the RAID array fully fails due to further issues with disks.
Diagnosis¶
You can open a shell on the node and use the standard Linux utilities to diagnose the issue, but you may need to install additional software in the debug container:
$ NODE_NAME='<value of instance label from alert>'
$ oc debug "node/$NODE_NAME"
$ cat /proc/mdstat
Mitigation¶
Cordon and drain node if possible, proceed to RAID recovery.
See the Red Hat Enterprise Linux [documentation][1] for potential steps.