Critical Hard Drive Failures … ?

The sound you just heard was me wailing in disbelief.

S.M.A.R.T Errors on /dev/hda
From Command: /usr/sbin/smartctl -q errorsonly -H -l selftest -l error /dev/hda
ATA Error Count: 302 (device log contains only the most recent five errors)
Error 302 occurred at disk power-on lifetime: 10418 hours (434 days + 2 hours)
Error 301 occurred at disk power-on lifetime: 10418 hours (434 days + 2 hours)
Error 300 occurred at disk power-on lifetime: 10294 hours (428 days + 22 hours)
Error 299 occurred at disk power-on lifetime: 10294 hours (428 days + 22 hours)
Error 298 occurred at disk power-on lifetime: 10150 hours (422 days + 22 hours)

Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 80% 19242 48683947
—-END /dev/hda–

S.M.A.R.T Errors on /dev/hdb
From Command: /usr/sbin/smartctl -q errorsonly -H -l selftest -l error /dev/hdb
ATA Error Count: 28 (device log contains only the most recent five errors)
Error 28 occurred at disk power-on lifetime: 29912 hours (1246 days + 8 hours)
Error 27 occurred at disk power-on lifetime: 29912 hours (1246 days + 8 hours)
Error 26 occurred at disk power-on lifetime: 29912 hours (1246 days + 8 hours)
Error 25 occurred at disk power-on lifetime: 10444 hours (435 days + 4 hours)
Error 24 occurred at disk power-on lifetime: 10444 hours (435 days + 4 hours)

Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 80% 19274 52155480
—-END /dev/hdb–

That could be nothing. It could be everything. It just … stunned me. The datacenter is going to run some diagnostics. We’ll see. Hopefully it’s nothing. But … gah.

5 Responses to “Critical Hard Drive Failures … ?”

  1. Rae says:

    DO NOT WANT!

  2. Jason says:

    One of the most common times for drive errors is when it’s first starting to be used.

    –Jason (sorry dude)

  3. Hard Drive Failure Update…

    [Note: Where I used to update the Weblog entries, now I will do a new one to push out updates via Twitter.]
    The investigation of this morning’s apparent hard drive failures was that there are 21 blocks bad on /dev/hda [our main drive] and 41 bad …

Leave a Reply