Start a Conversation

Unsolved

K

1 Rookie

 • 

3 Posts

23

November 6th, 2023 19:35

Failing PERC H710 Mini?

Hello,

I have a PowerEdge R620 with a PERC H710 Mini configured for one RAID 10 array (4 2TB non-Dell SATA + 1 spare).  I originally noticed an issue when multiple esxi VMs would have repeated chkdsk or fsck failures, sometimes back-to-back.

I did one run of memtest that passed.  I deleted and disabled CACHECADE, still errors.  I set No Read Ahead and Write Through, still errors.  Long history of patrol reads are all passes.  I haven't done any consistency checks in the past, but I did one yesterday that found 10 inconsistencies and resulted in "The Check Consistency made corrections and completed".  I started another consistency check today that is about 25% done, and already 3x "The Check Consistency found inconsistent parity data. Data redundancy may be lost".

Am I correct that this looks like a failing PERC?  Could it be anything else?  Any suggestions that could narrow down the issue?

Moderator

 • 

3.5K Posts

November 7th, 2023 05:51

Hello thanks for choosing Dell. I wouldn't jump into the controller fault- I would first suspect the OS or the disk issue.
If your system is still in warrenty, it's best you send the TTY log to the team
https://dell.to/40oARD7
You can also try contacting the OS provider or a VM ware team.
Respectfully,

1 Rookie

 • 

3 Posts

November 7th, 2023 14:43

@DELL-Young E​ Hello and thanks for your reply.  I do not have a warranty from Dell or VMWare.

The firmware on the PERC is the latest version from Dell.  The OS is the latest DellEMC Customized version of VMware ESXi Image certified to run on the hardware.

The disks are Seagate Enterprise drives.  All historical scheduled patrol reads passed.  Additionally, there are no errors reported for these drives in lsiexport.log.

Can you suggest any other logging to review or diagnostics to run to help pinpoint the issue? 

Moderator

 • 

3.3K Posts

07-11-2023 03:49 PM

If there is some issue on the hard drives then PERC normally put the hard drive as failed.

Anyway you can run a diagnostic tool on the hard drives.

It could be a logical problem of the array, i would start again some check consistency but if it still errors, you can try to create a new virtual disk as test and check if there is error on it.

If not then it is probably a logical issue on the volume and i suggest to backup and recreate raid volume.

Thanks

DELL- Marco B

Social Media and Communities Professional

Dell Technologies | Enterprise Support Services

#IWork4Dell

Did I answer your query? Please click on ‘Mark as Accepted Answer’. ‘Thumbs up’ the posts you like!

1 Rookie

 • 

3 Posts

November 7th, 2023 16:06

Hello Marco, thanks for your reply.

you can try to create a new virtual disk as test and check if there is error on it.

Just to make sure I understand, you're recommending I create a separate and new virtual disk on separate and new drives as a way to remove my existing drives from the equation and potentially isolate the problem to the PERC?

Moderator

 • 

3.1K Posts

07-11-2023 04:58 PM

Hello,

 

I think Marco may be saying make a RAID0 of the failed drive, if there is a failed drive, and test. I'm not sure and he is off shift at the moment.

 

Next I would recommend enter the LifeCycle Controller <F10>

 

*Review the LifeCycle Log for any storage errors.

 

*Then run the Hardware Diagnostics in the LCC.   Note any error messages and continue testing.

 

 

You indicated the controller is up to date. I would also recommend the hard drive firmware be brought up to date if they aren't already.   R620 support page : https://dell.to/476Vru7

 

Check that you have a good backup in case you there is logical corruption so you can recreate your array and restore data if necessary.

DELL-Charles R

Social Media and Communities Professional
Dell Technologies | Enterprise Support Services
#IWork4Dell

Did I answer your query? Please click on ‘Accept as Solution’. ‘Thumbs up’ the posts you like!

No Events found!

Top