This post is more than 5 years old

6 Posts

158230

November 27th, 2013 09:00

MD1000 Array disk warning. Command timeout on physical disk

Hi All,

I am getting numerous "Array disk warning. Command timeout on physical disk" alerts on a disk in our MD1000, although the disk reports failure predicted as NO...

Full Error:
Date:11/27/2013, Time:14:17:42, Severity:Warning, Message:Alert message ID: 2405, Array disk warning. Command timeout on physical disk, Controller 0, Connector 0, Physical Disk 0:0:8 [Event Code:2405]

Disk Details:

ID 0:0:8
Status OK
Name Physical Disk 0:0:8
State Online
Bus Protocol SAS
Media HDD
Revision MS0C
Failure Predicted No
Capacity 931.00GB
Used RAID Disk Space 931.00GB
Available RAID Disk Space 0.00GB
Hot Spare No
Vendor ID DELL(tm)
Product ID ST31000640SS
Serial No. 9QJ52QM2
Part Number TH0CP4642123398O00VZA00
Manufacture Day 02
Manufacture Week 35
Manufacture Year 2009
SAS Address 5000C5000D7E6059

Now for the noob questions - I've looked but can't find a satisfactory answer...

I presume my best option is to replace the disk (I have a replacement available), however as its part of a RAID 5 virtual disk and reporting as OK will I need to make the disk offline first before replacing?

If so, will this then trigger the use of the global hot spare or will it just allow me to replace the "faulty" drive with the new one and rebuild that in to the virtual disk?

Thanks for your help.

685 Posts

November 28th, 2013 05:00

Yes that is the correct process. When you offline the disk, the hotspare in the RAID array will start to rebuild the rebuild process. Once the drive completes the rebuild and you put the new drive in the array the new drive will then become hotspare. Is this directly connected to server or is it behind an MD3000?

685 Posts

November 27th, 2013 10:00

kipper75,

As soon a drive goes into a Failed state the hotspare will then start to rebuild into the array. Then when the replacement drive is put into the server it will take over as the hotspare. Please let me know if you have any other questions.

6 Posts

November 28th, 2013 00:00

Thanks Kenny K, however...

The disk has been like this for a while now and hasn't failed and failure isn't predicted. The alerts are coming in at least hourly and I'm concerned this will be affecting performance. I'd like to deal with the situation proactively before the drive fails and wondered what the right procedure for this would be?

I presume the right process would be: Offline disk --> remove from array --> replace with new --> online disk?? --> RAID rebuild (automatically??) but want to be sure so as not to lose data...

Thanks again for the help.

6 Posts

November 28th, 2013 06:00

Thanks again Kenny,

Its directly connected to a server. There are three separate virtual disks created from the 15 physical disks. Two 3 disk RAID 5 and an 8 Disk RAID 5 with a hot spare. The failed drive is in the 8 disk array

last question; if I do the swap straight away will that prevent the current hot spare being used (not that it matters, just wondered?)

685 Posts

November 28th, 2013 14:00

As soon as you pull out the drive in question the hotspare will start the rebuild. Are you looking for a way to make the hotspare not rebuild? Only way to do that would be to remove the hotspare so it can't rebuild and then swap the drive in question. Let the rebuild complete then add the hotspare back in to the array.

No Events found!

Top