Start a Conversation

Unsolved

C

7 Posts

242

February 23rd, 2023 07:00

How to determine what is causing a non-critical status on R900?

I had posted about a new SAS drive not working late last week, and so I replaced the drive and its currently Rebuilt, and Online. But the status is showing as non-critical; How can I determine why the drive showing a non-critical status?

The only thing that stands out in the logs is: Correctable memory error rate exceeded for Card D DIMM, and CMOS Battery is dead....

Here are some screenshots of the drive from OpenManage: 

MicrosoftTeams-image.pngMicrosoftTeams-image (1).pngMicrosoftTeams-image (2).png

7 Posts

February 23rd, 2023 11:00

Yeah, this is a new drive unfortunately. Replaced the original drive with the exact same Dell drive...part, model number, size and rpms.

1 Rookie

 • 

52 Posts

February 23rd, 2023 11:00

"Failure Predicted ... Yes".

Look/get/order a replacement drive.

Moderator

 • 

4.2K Posts

February 23rd, 2023 12:00

Hello CTKL,

 

pjwelsh  is correct.  From you images PD1:0:6 shows Failure Predicted=Yes and needs to be replaced.

 

You can see if it is just a reporting error with OMSA

Shut down OMSA

Stop the  services in order listed

Then start services in order listed

 

DSM services:

DSM SA Connection Service

DSM SA Shared Services

DSM SA Data Manager

DSM SA Event Manager

 

View the drive status again

 

 

If it still shows Pred Fail then it will need to be replaced.

 

Make sure you have a validated backup.

 

When we have a predictive fail drive that is still an online member of the array, there are 2 things that should be done before pulling and replacing: Using OpenManage Server Administrator (OMSA) or the controller BIOS.

 

1.Consistency check:

In OMSA: Expand Storage>expand controller. Select ‘Virtual Disk’. Choose ‘Check Consistency’ from dropdown and Execute.

 

2.When completed consistency check then Put drive offline before replacing:

In OMSA: Expand Storage>expand controller>expand connector>enclosure>select physical disks (or array disks). Select the drop down next to hard drive , choose offline and execute.

 

The rebuild should start automatically once the drive is replaced.  If it does not then you can use OMSA or PERC BIOS to set the replacement drive as a hot spare that will make the rebuild begin.

 

3 Apprentice

 • 

482 Posts

February 23rd, 2023 20:00

its a predictive failure, simple solution replace the drive no need to worry.

7 Posts

February 24th, 2023 09:00

Thank you for the insight, I really appreciate it as I'm learning as I go. I'm concerned though if I purchase another hard drive that drive will also generate a predictive fail. The drive that I replace last week, was brand new. I was told to buy the exact same drive and it should work which I did. But I guess that's not the case. 

 

View All

No Events found!

Top