Start a Conversation

Solved!

Go to Solution

1 Rookie

 • 

3 Posts

24

November 21st, 2024 11:49

7 ssds in predictive fail

Hi,

Got 8 ssds in raid 10.

The system event log logged one as failed. Then a few minutes later nothrr log of a predictive fail on an adjacent ssd. However the drac gui and racadm says all of the other ssds are predictive fail. Replaced the failed ssd and it rebuilt and is green. The others are still all in predictive fail. The perc controller is green,

Any ideas please? I'm thinking this must be a controller problem as 7 ssds can't all predictive fail simultaneously?

Thanks

Moderator

 • 

4.1K Posts

November 21st, 2024 20:31

Hello,

 

If they all went pred fail that long ago and one finally failed, I would suspect the others may start to fail.

 

Be sure to keep good backups.

 

Are the drives Dell drives? Non-Dell drives may not report correctly.

 

If they are Dell drives then update the firmware from the web page:

https://dell.to/3AUt9s4

 

Then run a Consistency check:

In OpenManage Server Administrator (OMSA): Expand Storage>expand controller. Select ‘Virtual Disk’. Choose ‘Check Consistency’ from dropdown and Execute.

 

If you don't have OMSA then you can start the Consistency check in the PERC BIOS <Ctrl+R> when POST.

 

If the hard drive is still showing predictive failure after updates, and consistency check, the hard drive replacement is required.

 

I'd recommend take this time to start replacing the pred fail drives before puncture or all drives fail.

Moderator

 • 

4.1K Posts

November 21st, 2024 16:06

Hello,

 

That is not typical but we could see that if there is a puncture in the array.

 

First make sure you have a good backup.

 

Check the LifeCycle Log and System Event log for any drive errors. Especially look for puncture.

 

Make sure the PERC controller firmware is up to date.

 

Run the built in Diagnostics. Boot to <F10> LifeCycle Controller, go to Diagnostics and run.

 

Let me know how that goes and also the Server Model and PERC controller you have and any drive errors you see in the logs.

1 Rookie

 • 

3 Posts

November 21st, 2024 17:33

Hi Charles, thanks for your reply.

No punctures in the logs.

On further investigation, all 8 ssds entered predictive fail in late March/early April within 23 days of each other. Then 7/8 months later one failed.

The Perc controller H730 (mini) firmware is up to date at 25.5.9.0001.as is Bios at 2.19.0 and Drac at 2.86.86.86.  Its an R630.

In diagnostics, the 7 failed ssds are all predictive failure, "incorrect status = 800000000000005D" 

Thanks

No Events found!

Top