Highlighted
2 Bronze

PE2950 drive suddenly goes missed

Pe2950 running two RAID1 array (disk 0-1 and 2-3) on perc6-i controller.

Suddenly, while running , disk 2 goes into missed state (into array) and is set as foreign.

Restoring disk into array (via bios or perccli) it lasts few days then goes missed again.

Same identical issue if disk is replaced with an identical one ( ST3146356SS Seagate 146Gb SAS)

Any known issue ?

Thank you

 

0 Kudos
8 Replies
Highlighted
Moderator
Moderator

Re: PE2950 drive suddenly goes missed

Hello @Oknet,

 

What firmware version do you have for the PERC controller and for the HDDs? The latest version for HDDs is

HS11.

 

What you mean by "restoring disk into array"? Do you mean to import the foreign configuration?

 

Regards.

logo
 Diego López
 
 Social Media and Communities Professional
 Comunidad Dell | @DellAyudaPro | Eventos en línea
0 Kudos
Highlighted
2 Bronze

Re: PE2950 drive suddenly goes missed

Hi, 

from perccli:

 

FW Package Build = 6.3.1-0003
FW Version = 1.22.32-1371
BIOS Version = 2.04.00
Driver Name = megaraid_sas
Driver Version = 06.805.56.0

 

How to check HDD firmware version and eventually update it ?

 

Once I have find the missed disk marked as foreign, I have delete it from foreign config and readded to array with :

./perccli /c0 /e32 /s2 insert dg=1 array=0 row=0

then started rebuild

Thanks

 

0 Kudos
Highlighted
Moderator
Moderator

Re: PE2950 drive suddenly goes missed

Hello what OS do you have installed in the server?

 

Regards.

logo
 Diego López
 
 Social Media and Communities Professional
 Comunidad Dell | @DellAyudaPro | Eventos en línea
0 Kudos
Highlighted
2 Bronze

Re: PE2950 drive suddenly goes missed

Hi

Sorry for late reply

I'm running ESXi5.5 on that host.

The strange thing is first disk of second raid 1  goes missing after few days :

-----------------------------------------------------------------------------
DG Arr Row EID:Slot DID Type State BT Size PDC PI SED DS3 FSpace TR
-----------------------------------------------------------------------------
1 - - - - RAID1 Dgrd N 136.125 GB dflt N N dflt N N
1 0 - - - RAID1 Dgrd N 136.125 GB dflt N N dflt N N
1 0 0 - - DRIVE Msng - 136.125 GB - - - - - N
1 0 1 32:3 3 DRIVE Onln N 136.125 GB dflt N N dflt - N
-----------------------------------------------------------------------------

 

Other than supposed faulty drive , I've tried other FOUR (!) identical disks

The behavior is always the same, disks missed after few days (2-3-4...)

All are used disks but something make me think about controller issue....

How can I troubleshoot this ?

Can I check each single disk for integrity/functionality ?

Thank you

0 Kudos
Highlighted
Moderator
Moderator

Re: PE2950 drive suddenly goes missed

I would recommend exporting the controller log from the PERC and reviewing it for errors. If multiple drives in that slot are randomly showing as missing, I would suspect the backplane or the cabling, and reviewing the PERC log may help confirm that. I'd go ahead and reseat the backplane cabling, as well as export the log from perccli and see what it says.

 

./perccli /c0 show logfile=log.txt

#Iwork4Dell
0 Kudos
Highlighted
2 Bronze

Re: PE2950 drive suddenly goes missed

In this moment I'm ended in a situation where each of five disks are not seen by controller.

The controller still sees three phisycal devices no matter which one of five spares are inserted, still after power cycling host each time a spare disk is inserted in E32:2 (third slot of six available on my PE2950)....

 

0 Kudos
Highlighted
Moderator
Moderator

Re: PE2950 drive suddenly goes missed

Without that log, there isn't any recommendation I could make beyond creating a new array. There isn't enough information for me to provide further steps.

#Iwork4Dell
0 Kudos
Highlighted
2 Bronze

Re: PE2950 drive suddenly goes missed

Not able to get log in this moment, however it tells no more than a drive is missed but nothing about the cause.

I have opened PE2950 re-seated all connectors relative to controller , disks backplane and sideplane the controller is connected to , nothing seemed faulty however.

Drive 2 is now detected, reinserted into disk group and rebuilt.

It lasts ok from three days, too early to say it's fixed.....

 

0 Kudos