[Pkg-nagios-devel] Bug#575382: nagios-plugins-standard: check_linux_raid does not warn if resync is in process

Petter Reinholdtsen pere at hungry.com
Tue Feb 14 20:04:34 UTC 2012


[Christoph Martin]
> check_linux_raid should also warn if a md device is in resync mode and
> not only if in recovory mode.

I suspect this is the wrong conclusion to the problem.  I ran into a
similar issue with my Nagios monitored raid, where one of the disks
failed and the spare were automatically resynced into the RAID.  But I
do not want a warning because of the resync.  I want a warning because
there is a failing disk.

So in this case:

[Jan Wagner]
> md3 : active raid10 sdd4[4](F) sdc4[1] sdb4[5](F) sda4[0]
>       1887974656 blocks 64K chunks 2 near-copies [4/2] [UU__]
>       [==========>..........]  recovery = 50.3% (474987648/943987328) 
> finish=4363639.0min speed=1K/sec

I believe the module should report the devices listed with '(F)' as at
least a warning and preferably a critical issue, and ignore the fact
that a sync/recovery is in progress.

It would also be nice if it would report the disk serial number of the
failing disk, to make it easier to locate the correct disk when
replacing disks.  The serial number can either be discovered using
'hdparm -I /dev/sdd4' (in the example above), or by looking in /sys/.
-- 
Happy hacking
Petter Reinholdtsen





More information about the Pkg-nagios-devel mailing list