[colug-432] md (linux drive mirroring)

Judd Montgomery judd at engineer.com
Wed Aug 21 18:13:02 EDT 2019


Just for you I pulled a powered up drive out of the USB dock and risked
my data.

I got emailed instantly.

This is from /var/log/kern.log

Aug 21 17:41:02 mach1 kernel: [255336.724544] sd 6:0:0:0: [sde]
Synchronizing SCSI cache
Aug 21 17:41:02 mach1 kernel: [255336.725244] md: super_written gets
error=10
Aug 21 17:41:02 mach1 kernel: [255336.725246] md/raid10:md125: Disk
failure on sde1, disabling device.
Aug 21 17:41:02 mach1 kernel: [255336.725246] md/raid10:md125: Operation
continuing on 3 devices.
Aug 21 17:41:02 mach1 kernel: [255336.725332] md: super_written gets
error=10
Aug 21 17:41:02 mach1 kernel: [255336.725333] md/raid10:md124: Disk
failure on sde2, disabling device.
Aug 21 17:41:02 mach1 kernel: [255336.725333] md/raid10:md124: Operation
continuing on 3 devices.
Aug 21 17:41:02 mach1 kernel: [255336.812856] sd 6:0:0:1: [sdf] tag#25
FAILED Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
Aug 21 17:41:02 mach1 kernel: [255336.812858] sd 6:0:0:1: [sdf] tag#25
CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
Aug 21 17:41:02 mach1 kernel: [255336.812875] print_req_error: I/O
error, dev sdf, sector 2064 flags 20801
Aug 21 17:41:02 mach1 kernel: [255336.812878] md: super_written gets
error=10
Aug 21 17:41:02 mach1 kernel: [255336.812880] md/raid10:md125: Disk
failure on sdf1, disabling device.
Aug 21 17:41:02 mach1 kernel: [255336.812880] md/raid10:md125: Operation
continuing on 2 devices.
Aug 21 17:41:02 mach1 kernel: [255336.912850] sd 6:0:0:1: [sdf] tag#27
FAILED Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
Aug 21 17:41:02 mach1 kernel: [255336.912852] sd 6:0:0:1: [sdf] tag#27
CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
Aug 21 17:41:02 mach1 kernel: [255336.912855] print_req_error: I/O
error, dev sdf, sector 1953515536 flags 20801
Aug 21 17:41:02 mach1 kernel: [255336.912857] md: super_written gets
error=10
Aug 21 17:41:02 mach1 kernel: [255336.912859] md/raid10:md124: Disk
failure on sdf2, disabling device.
Aug 21 17:41:02 mach1 kernel: [255336.912859] md/raid10:md124: Operation
continuing on 2 devices.

This is from syslog

Aug 21 17:41:02 mach1 kernel: [255336.720913] usb 2-7: USB disconnect,
device number 14
Aug 21 17:41:02 mach1 kernel: [255336.724544] sd 6:0:0:0: [sde]
Synchronizing SCSI cache
Aug 21 17:41:02 mach1 kernel: [255336.725244] md: super_written gets
error=10
Aug 21 17:41:02 mach1 kernel: [255336.725246] md/raid10:md125: Disk
failure on sde1, disabling device.
Aug 21 17:41:02 mach1 kernel: [255336.725246] md/raid10:md125: Operation
continuing on 3 devices.
Aug 21 17:41:02 mach1 kernel: [255336.725332] md: super_written gets
error=10
Aug 21 17:41:02 mach1 kernel: [255336.725333] md/raid10:md124: Disk
failure on sde2, disabling device.
Aug 21 17:41:02 mach1 kernel: [255336.725333] md/raid10:md124: Operation
continuing on 3 devices.
Aug 21 17:41:02 mach1 kernel: [255336.812856] sd 6:0:0:1: [sdf] tag#25
FAILED Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
Aug 21 17:41:02 mach1 kernel: [255336.812858] sd 6:0:0:1: [sdf] tag#25
CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
Aug 21 17:41:02 mach1 kernel: [255336.812875] print_req_error: I/O
error, dev sdf, sector 2064 flags 20801
Aug 21 17:41:02 mach1 kernel: [255336.812878] md: super_written gets
error=10
Aug 21 17:41:02 mach1 kernel: [255336.812880] md/raid10:md125: Disk
failure on sdf1, disabling device.
Aug 21 17:41:02 mach1 kernel: [255336.812880] md/raid10:md125: Operation
continuing on 2 devices.
Aug 21 17:41:02 mach1 kernel: [255336.912850] sd 6:0:0:1: [sdf] tag#27
FAILED Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
Aug 21 17:41:02 mach1 kernel: [255336.912852] sd 6:0:0:1: [sdf] tag#27
CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
Aug 21 17:41:02 mach1 kernel: [255336.912855] print_req_error: I/O
error, dev sdf, sector 1953515536 flags 20801
Aug 21 17:41:02 mach1 kernel: [255336.912857] md: super_written gets
error=10
Aug 21 17:41:02 mach1 kernel: [255336.912859] md/raid10:md124: Disk
failure on sdf2, disabling device.
Aug 21 17:41:02 mach1 kernel: [255336.912859] md/raid10:md124: Operation
continuing on 2 devices.

Before failure:

$ cat /proc/mdstat
Personalities : [raid0] [linear] [multipath] [raid1] [raid6] [raid5]
[raid4] [raid10]
md124 : active raid10 sdf2[5] sde2[4] sdc2[6] sdd2[7]
       1953251328 blocks super 1.2 512K chunks 2 near-copies [4/4] [UUUU]
       bitmap: 0/15 pages [0KB], 65536KB chunk

md125 : active raid10 sdf1[3] sde1[1] sdd1[5] sdc1[4]
       1953251328 blocks super 1.2 512K chunks 2 near-copies [4/4] [UUUU]
       bitmap: 0/15 pages [0KB], 65536KB chunk

unused devices: <none>

After failure

$ cat /proc/mdstat
Personalities : [raid0] [linear] [multipath] [raid1] [raid6] [raid5]
[raid4] [raid10]
md124 : active (auto-read-only) raid10 sdc2[6] sdd2[7]
       1953251328 blocks super 1.2 512K chunks 2 near-copies [4/2] [U_U_]
       bitmap: 0/15 pages [0KB], 65536KB chunk

md125 : active (auto-read-only) raid10 sdd1[5] sdc1[4]
       1953251328 blocks super 1.2 512K chunks 2 near-copies [4/2] [U_U_]
       bitmap: 0/15 pages [0KB], 65536KB chunk

unused devices: <none>


On 8/21/19 4:25 PM, Jeff Frontz wrote:
>
> Thanks, Judd -- when the raid failed, were there any syslog/journald
> messages indicating such?
>
>
> On Wed, Aug 21, 2019 at 2:41 PM Judd Montgomery <judd at engineer.com
> <mailto:judd at engineer.com>> wrote:
>
>      I setup md with raid-0, 5 and 10 (tried them all, just playing)
>     with 4 drives.  It would occasionally get corrupted.  I tracked it
>     down to when the 2 docks were turned on in a certain order the
>     kernel would bounce one of them and cause the raid to fail.
>
>
> _______________________________________________
> colug-432 mailing list
> colug-432 at colug.net
> http://lists.colug.net/mailman/listinfo/colug-432
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.colug.net/pipermail/colug-432/attachments/20190821/a0949eb5/attachment.html 


More information about the colug-432 mailing list