Unsolved

Closed

1 Rookie

 • 

37 Posts

433

June 28th, 2023 13:00

A disk has predictive failure, from a raid 5

hi,
My name is Emiliano, I'm from Argentina.

I have a productive R720, in which the Physical Disk 0:1:2 a few months ago had a predictive failure.
The disk was changed for a new one, from Dell, and in the raid console, it was marked as hot spare, so that it would synchronize with the raid. Once it synchronized, it marked failed. Could this be a bug? The disc is new.

Captura de pantalla de 2023-06-28 17-26-27.png

Thank you so much

 

Regards

Moderator

 • 

2.9K Posts

June 29th, 2023 00:00

Hello, it seems that your disk failed after synchronization due to old PERC firmware or a bad block transferred from the previous disk. You can try to assign the disk as a global hot spare again and see if it synchronizes properly this time. If still fails, you can check your PERC firmware version and update it if needed. You can also run a consistency check on the virtual disk to clear the predicted failure flag. (my previous similar thread Solved: R720 iDrac still shows HDD drive error after replacement - Dell Community)

Another possible reason is that you need to have the latest Lifecycle controller OS driver pack installed in your system to configure the RAID properly. You can download the OS driver pack from  support site https://dell.to/3prQCuR and update it through iDRAC GUI or Lifecycle controller GUI.

1 Rookie

 • 

37 Posts

June 29th, 2023 05:00

Hi Erman,

Thank you for your answer. Consistency check can cause data loss?

I'm not a big fan of doing raid tasks on the idrac. Can I do this from the RAID controller console? How would it be?

Regards

 

Moderator

 • 

2.9K Posts

June 29th, 2023 05:00

Consistency checks are performed by RAID controllers in the background to ensure that the data on the disks in a RAID array is correct and has not been corrupted. These checks are also known as patrol reads or scrubbing. Consistency checks themselves should not cause data loss.

1 Rookie

 • 

37 Posts

June 29th, 2023 10:00

Hi,
Is it necessary to update the firmware?
From what I saw, it has the latest PERC firmware, and IDRAC's is one of the latest.

PERC H710 Mini Firmware 21.3.5-0002
Lifecycle Controller Firmware 2.63.60.62
BIOS Version 2.8.0

Do you want me to upload the file support assistance?

 

Regards

Moderator

 • 

4.7K Posts

June 29th, 2023 11:00

Hello emi87,

 

Have you verified you have a valid backup?

 

Sometimes a controller holds on to a Pred Fail marking even if it has rebuilt into the array.

Have you performed a flea power drain and check if it still marked Pred Fail?

drain flea power (shut down, disconnect power cables and Network cables, hold in power button 20 seconds with cords removed).  After flea power drain, system has to set for 3 minutes for DRAC to reset without any power plugged in, then plug in NIC and power but wait 2 minutes before power on to give DRAC time to initialize.

 

 

Have you run the Consistency check yet? 

You can do it with OpenManage Server Administrator (OMSA) if you have it installed or in the Controller BIOS.

If the check consistency completes without errors, you can safely assume that the array is now healthy.

Consistency check does not cause data loss. It may reveal data loss if it cannot repair the inconsistency.

 

If it remains Pred Fail, it will need to be replaced.

When we have a predictive fail drive that is still an online member of the array, there are 2 things that need to be done, after a file level backup and before pulling and replacing: using OpenManage Server Administrator (OMSA)or Controller BIOS

 

1.Consistency check:

In OMSA: Expand Storage>expand controller. Select ‘Virtual Disk’. Choose ‘Check Consistency’ from dropdown and Execute.

 

2.When completed consistency check then Put drive offline before replacing:

In OMSA: Expand Storage>expand controller>expand connector>enclosure>select physical disks (or array disks). Select the drop down next to hard drive , choose offline and execute.

 

Your firmware:

PERC H710 Mini Firmware 21.3.5-0002  (is current)

 

You are a little behind on these so you can update them in a maintenance window.

Lifecycle Controller Firmware 2.63.60.62

iDRAC,LifeCycle Controller v. 2.65.65.65

https://dell.to/46lepNA

BIOS Version 2.8.0

BIOS 2.9.0

https://dell.to/46v4hlC

 

Top