Start a Conversation

Unsolved

This post is more than 5 years old

113898

July 15th, 2013 05:00

MD3220 RAID Controller Module wide port has gone from degraded to failed

The MD3220 hosts the shared storage of a Hyper-V-Failover-Cluster.

At weekand in the night the MD3230 suddenly shows following errors:

Datum/Uhrzeit: 13.07.13 23:46:53
Sequenznummer: 4053
Ereignistyp: 282B
Ereigniskategorie: Intern
Priorität: Kritisch
Beschreibung: Expansion enclosure path redundancy lost
Ereignisspezifische Kodes: 0/0/0
Komponententyp: Gehäuse
Komponentenstandort: Gehäuse, 0
Protokolliert von: RAID-Controller-Modul in Steckplatz 1

Rohdaten:
4d 45 4c 48 03 00 00 00 d5 0f 00 00 00 00 00 00
2b 28 4a 01 cd ca e1 51 00 00 00 00 00 00 00 00
00 00 00 00 04 00 00 00 22 00 00 00 22 00 00 00
0a 00 00 00 00 00 00 00 01 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 36 00 01 01 08 00 00 00 04 00 00 08
00 00 00 00

Datum/Uhrzeit: 13.07.13 23:46:37
Sequenznummer: 4049
Ereignistyp: 1710
Ereigniskategorie: Fehler
Priorität: Kritisch
Beschreibung: RAID Controller Module wide port has gone from degraded to failed
Ereignisspezifische Kodes: 0/0/0
Komponententyp: RAID-Controller-Modul
Komponentenstandort: Gehäuse, 0, Steckplatz 0
Protokolliert von: RAID-Controller-Modul in Steckplatz 1

Rohdaten:
4d 45 4c 48 03 00 00 00 d1 0f 00 00 00 00 00 00
10 17 18 01 bd ca e1 51 14 00 00 00 00 01 00 00
00 00 00 00 01 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 01 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 01 02 10 00 00 00 04 00 00 08
00 00 00 00 04 00 26 08 01 00 00 00

Datum/Uhrzeit: 13.07.13 23:46:37
Sequenznummer: 4048
Ereignistyp: 1710
Ereigniskategorie: Fehler
Priorität: Kritisch
Beschreibung: RAID Controller Module wide port has gone from degraded to failed
Ereignisspezifische Kodes: 0/0/0
Komponententyp: RAID-Controller-Modul
Komponentenstandort: Gehäuse, 0, Steckplatz 1
Protokolliert von: RAID-Controller-Modul in Steckplatz 1

Rohdaten:
4d 45 4c 48 03 00 00 00 d0 0f 00 00 00 00 00 00
10 17 18 01 bd ca e1 51 14 00 00 00 00 01 01 00
00 00 00 00 01 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 02 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 01 02 10 00 00 00 04 00 00 08
00 00 00 00 04 00 26 08 02 00 00 00

But the Modular Disk Storage Manager shows all disks ok.

The eventlog of the cluster nodes shows at the same time a series of errors, warnings and informations like

The driver detected a controller error on \Device\RaidPort1.

admd1:0:0 Path Failed.

A fail-over on \Device\MPIODisk1 occurred.

Device path information for admd1:0:0:0 has been removed per MPIO notification (object count 1).

What happens with the Storage?

(Sorry for the mix of german and english, but the Storage Manager is in german, the server OS is english)

 

685 Posts

July 15th, 2013 11:00

We would need to pull a support bundle from the MDSM Software. Being that the MDSM is German Once I get that from you I may need to tie in a rep from our Germany Support to assist with the language barrier. But let's go ahead and start by getting that support bundle so we can get the ball rolling. If you have any questions feel free to let me know as I would be happy to assist. I will also be sending you a direct email so when you get the support bundle you can send it directly to me.

July 17th, 2013 01:00

I got an answer from the Dell Support. I hope I can translate it correctly.

The error message was caused by the firmware. The controller cache will partly not kept clean synchronous, this caused a reboot of the synchronous controller.

The solution is a firmware update of the controller firmware.

I will do that at the next maintanance day this month and than we will see.

No Events found!

Top