Overeager head parking? On Feb 3, 2015 6:39 PM, "Mark Mitchell" <mark.russel.mitchell at gmail.com> wrote: > I'm running my first RAID array in a machine I built just short of a > year ago. I'm getting repeated messages in kern.log about ata resets > on 2 ata channels. > > I took one of the affected drives out of the array, and ran a smart > long test on them (smart.sdd.txt, attached). It shows a head flying > time of 6912h+43m+51.802s (around 288 days). > > All of the drives on the system are showing pre-fail and OldAge in the > smart reports. I'm finding this difficult to believe, all of them > except sda are only about a year old. > > Do I really have to go out and buy a bunch of new 3TB drives? > > Here are some representative errors from kern.log; > > ==> /var/log/kern.log <== > Feb 3 18:31:46 home-desktop kernel: [611894.092255] ata5.00: > exception Emask 0x10 SAct 0x40000001 SErr 0x10200 action 0xe frozen > Feb 3 18:31:46 home-desktop kernel: [611894.092259] ata5.00: irq_stat > 0x00400000, PHY RDY changed > Feb 3 18:31:46 home-desktop kernel: [611894.092262] ata5: SError: { > Persist PHYRdyChg } > Feb 3 18:31:46 home-desktop kernel: [611894.092265] ata5.00: failed > command: READ FPDMA QUEUED > Feb 3 18:31:46 home-desktop kernel: [611894.092269] ata5.00: cmd > 60/a0:00:22:c0:0a/00:00:09:00:00/40 tag 0 ncq 81920 in > Feb 3 18:31:46 home-desktop kernel: [611894.092269] res > 40/00:00:22:c0:0a/00:00:09:00:00/40 Emask 0x10 (ATA bus error) > Feb 3 18:31:46 home-desktop kernel: [611894.092272] ata5.00: status: { > DRDY } > Feb 3 18:31:46 home-desktop kernel: [611894.092274] ata5.00: failed > command: READ FPDMA QUEUED > Feb 3 18:31:46 home-desktop kernel: [611894.092278] ata5.00: cmd > 60/08:f0:72:f9:66/02:00:08:00:00/40 tag 30 ncq 266240 in > Feb 3 18:31:46 home-desktop kernel: [611894.092278] res > 40/00:00:22:c0:0a/00:00:09:00:00/40 Emask 0x10 (ATA bus error) > Feb 3 18:31:46 home-desktop kernel: [611894.092281] ata5.00: status: { > DRDY } > Feb 3 18:31:46 home-desktop kernel: [611894.092285] ata5: hard resetting > link > Feb 3 18:31:51 home-desktop kernel: [611899.409269] ata5: SATA link > up 1.5 Gbps (SStatus 113 SControl 310) > Feb 3 18:31:51 home-desktop kernel: [611899.435209] ata5.00: > configured for UDMA/33 > Feb 3 18:31:51 home-desktop kernel: [611899.449242] ata5: EH complete > Feb 3 18:32:17 home-desktop kernel: [611925.496050] ata6: exception > Emask 0x10 SAct 0x0 SErr 0x10002 action 0xe frozen > Feb 3 18:32:17 home-desktop kernel: [611925.496054] ata6: irq_stat > 0x00400000, PHY RDY changed > Feb 3 18:32:17 home-desktop kernel: [611925.496057] ata6: SError: { > RecovComm PHYRdyChg } > Feb 3 18:32:17 home-desktop kernel: [611925.496061] ata6: hard resetting > link > Feb 3 18:32:22 home-desktop kernel: [611930.406105] ata5: exception > Emask 0x10 SAct 0x0 SErr 0x10200 action 0xe frozen > Feb 3 18:32:22 home-desktop kernel: [611930.406109] ata5: irq_stat > 0x00400000, PHY RDY changed > Feb 3 18:32:22 home-desktop kernel: [611930.406111] ata5: SError: { > Persist PHYRdyChg } > Feb 3 18:32:22 home-desktop kernel: [611930.406116] ata5: hard resetting > link > Feb 3 18:32:24 home-desktop kernel: [611932.038938] ata6: SATA link > up 1.5 Gbps (SStatus 113 SControl 310) > Feb 3 18:32:28 home-desktop kernel: [611935.720865] ata5: SATA link > up 1.5 Gbps (SStatus 113 SControl 310) > Feb 3 18:32:28 home-desktop kernel: [611935.739014] ata5.00: > configured for UDMA/33 > Feb 3 18:32:28 home-desktop kernel: [611935.752837] ata5: EH complete > Feb 3 18:32:29 home-desktop kernel: [611937.036124] ata6.00: qc > timeout (cmd 0xec) > Feb 3 18:32:29 home-desktop kernel: [611937.036135] ata6.00: failed > to IDENTIFY (I/O error, err_mask=0x4) > Feb 3 18:32:29 home-desktop kernel: [611937.036137] ata6.00: > revalidation failed (errno=-5) > Feb 3 18:32:29 home-desktop kernel: [611937.036141] ata6: hard resetting > link > Feb 3 18:32:30 home-desktop kernel: [611937.527854] ata6: SATA link > up 1.5 Gbps (SStatus 113 SControl 310) > Feb 3 18:32:30 home-desktop kernel: [611937.528629] ata6.00: supports > DRM functions and may not be fully accessible > Feb 3 18:32:30 home-desktop kernel: [611937.529644] ata6.00: supports > DRM functions and may not be fully accessible > Feb 3 18:32:30 home-desktop kernel: [611937.529824] ata6.00: > configured for UDMA/33 > Feb 3 18:32:30 home-desktop kernel: [611937.529997] ata6: EH complete > > Here's my drive layout; > mark at home-desktop:~$ sudo lsblk > NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT > sda 8:0 0 931.5G 0 disk > ├─sda1 8:1 0 37M 0 part /boot/efi > ├─sda2 8:2 0 37.3G 0 part [SWAP] > ├─sda3 8:3 0 860.8G 0 part /home > └─sda4 8:4 0 33.5G 0 part / > sdb 8:16 0 2.7T 0 disk > └─sdb1 8:17 0 2.7T 0 part > └─md0 9:0 0 8.2T 0 raid5 > └─md0p1 259:0 0 8.2T 0 md /srv/media > sdc 8:32 0 2.7T 0 disk > └─sdc1 8:33 0 2.7T 0 part > └─md0 9:0 0 8.2T 0 raid5 > └─md0p1 259:0 0 8.2T 0 md /srv/media > sdd 8:48 0 2.7T 0 disk > └─sdd1 8:49 0 2.7T 0 part > └─md0 9:0 0 8.2T 0 raid5 > └─md0p1 259:0 0 8.2T 0 md /srv/media > sde 8:64 0 2.7T 0 disk > └─sde1 8:65 0 2.7T 0 part > └─md0 9:0 0 8.2T 0 raid5 > └─md0p1 259:0 0 8.2T 0 md /srv/media > sr0 11:0 1 4.3G 0 rom > > _______________________________________________ > TCLUG Mailing List - Minneapolis/St. Paul, Minnesota > tclug-list at mn-linux.org > http://mailman.mn-linux.org/mailman/listinfo/tclug-list > > -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mailman.mn-linux.org/pipermail/tclug-list/attachments/20150203/9afa6dd2/attachment-0001.html>