MD Series: MD34/38xx How to replace a hard drive in predictive failure
摘要: This tutorial explains how to replace a disk in predictive or impending failure in a MD34xx or MD38xx storage array.
说明
Definition:
A disk is always a bit bigger than specified because it has space to relocate bad blocks. It is allowed to have a couple of read/write errors. But if a threshold is reached, the controller changes the status of the disk to "predictive failure" because the relocation table is full.
The disk continues to work but the probability that the hard drive fails soon is high. As this disk is still working and fully part of the RAID, the disk has to be put offline before replacement. This tutorial covers the steps to offline the hard disk before replacement.
Prerequisite:
Modular Disk Storage Manager (MDSM) has to be installed for this procedure. The management station must be able to access to the storage array.
Steps:
Launch MDSM. If there are several PowerVault arrays, select the storage with the predictive failure.
Double-click the array to access to the matrix manager.
Click Hardware and then select the disk to be put offline.
Figure 1: MDSM Hardware tab
Do a right click and select Advanced, then Fail.
Figure 2: MDSM Hardware menu showing where to manually fail a drive.
To validate the operation, a menu asking to confirm the manual disk failing opens. Confirm by typing "Yes."
- If there is a hot spare disk, leave the box "Copy contents of hard drive before failing" checked
- The data of the impending disk failure is copied to the spare disk, to avoid any degradation of a RAID
- If there are no hot spare disks, clear the "Copy contents of hard drive before failing" box
The disk changes to a red X, and the status changes to "Failed." The last step is to physically replace the drive.