MD Series: Md3xxx How To replace a disk in predictive or impending failure

Summary: This article covers how to safely replace a predictive failure drive in a MD3xxx storage using Modular Disk Storage Management (MDSM).

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Instructions

Introduction  

This tutorial explains how to replace a disk in predictive or impending failure. A predictive failure is a feature of modern Hard Disc Drives (hard drive) designed to improve RAID reliability. A predictive failure indicates that a hard drive must be replaced before failure occurs. 

 


Cause

During normal read/write operations, an error may occasionally occur on a hard drive. The controller identifies this error and repairs it. These errors are also known as "Bad Blocks". This is why the memory space on a hard drive is usually, slightly larger than specified. This space is used to relocate or repair any bad blocks that occur during normal operations. A predetermined threshold of bad blocks is assigned to an individual hard drive. When this threshold is reached, the controller changes the status of the hard drive to "Predictive Fail". The hard drive remains operational, however, the probability that the hard drive will fail soon.

It is recommended that a hard drive in "Predictive Fail" status should be replaced promptly to maintain the integrity of the RAID volume. To replace the hard drive, it can be removed safely from the RAID volume before physical replacement. Follow the process outlined below to change the hard drive status to offline and safely remove it from the RAID volume.

 


Solution

Note : Before proceeding, Modular Disc Storage Manager (MDSM) must be installed. MDSM can be downloaded from the support site. The system must have access to the storage array.
Follow the below process to offline and safely remove the hard drive from the RAID volume.

  1. Launch MDSM and select the corresponding PowerVault array.
    1. Verify the correct array using the status of the enclosure
    2. If there are no issues with an array, it shows as "Optimal", as seen in Figure 1 below
      1. MDSM Devices view showing Optimal state
      2. Figure 1: MDSM Devices view showing Optimal state
    3. If the array has a hard drive in predictive failure, the Status changes to "Need attention"
  2. Double-click the array to access to the array manager
  3. Verify that there are no other missing or failed drives in the same RAID set
  4. Click Hardware, and then select the hard drive in predictive fail. The status shows as "Need attention"
    1. Hardware section of MDSM
    2. Figure 2: Hardware section of MDSM
  5. Right-click the hard drive and select Advanced, then Fail
    1. Right-click menu showing the Fail option
    2. Figure 3: Right-click menu showing the Fail option
  6. Acknowledge the drive failure operation by typing "Yes"
    1. If there is a spare disk in the array, also known as a "Hot Spare", leave the box "Copy contents of hard drive before failing" checked
    2. The data of the predictive failure disk is copied to the Hot Spare, to avoid any degradation of a RAID
    3. This is shown below in Figure 4
    4. If there is no Hot Spare, clear the "Copy contents of physical disk before failing" box
      1. Do not attempt to copy contents unless there is a Hot Spare available in the array
      2. Attempting this may cause data loss or corruption
    5. Confirm Fail Physical Disk dialogue
    6. Figure 4: Confirm Fail Physical Disk dialogue
  7. If the option to copy the contents is used,
    1. It may take some time before the drive fails and the state changes to "Failed"
  8. The status of the hard drive changes to "Failed" and has a red "X" next to it
  9. It is now safe to physically replace the hard drive

 


Affected Products

MD Series, Dell PowerVault OEM Ready MD34XX and MD38XX, PowerVault MD3200, PowerVault MD3200i, PowerVault MD3220, PowerVault MD3220i, PowerVault MD3260, PowerVault MD3260i, PowerVault MD3400, PowerVault MD3420, PowerVault MD3460, PowerVault MD3600F , PowerVault MD3600i, PowerVault MD3620F, PowerVault MD3620i, PowerVault MD3660f, PowerVault MD3660i, PowerVault MD3800f, PowerVault MD3800i, PowerVault MD3820f, PowerVault MD3820i, PowerVault MD3860f, PowerVault MD3860i ...
Article Properties
Article Number: 000132906
Article Type: How To
Last Modified: 21 Mar 2025
Version:  6
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.