Start a Conversation

Unsolved

This post is more than 5 years old

T

3692

March 31st, 2017 01:00

MD3420 Error : Controller Module RAID Removed or Replaced

The MD3420 hosts the shared storage of a VMware-Failover-Cluster.

Frequently, hosts lose paths to storage , the eventlog of the cluster nodes shows at the same time a series of errors (storage paths losed).

Date / Time: 30/03/17 14:27:53
Sequence number: 4294
Type of event: 1712
Event Category: Internal
Priority: Informative
An event should be checked: false
Event Send Alert: False
Visibility of the event: true
Description: The RAID controller module wide port is in the optimum state
Event-specific codes: 0/0/0
Component Type: RAID Controller Module
Component slot: Housing 0, Housing 1
Logged by: RAID Controller Module in Slot 1

Données brutes :
4d 45 4c 48 03 00 00 00 c6 10 00 00 00 00 00 00
12 17 48 00 a9 dd dc 58 14 00 00 00 00 01 01 00
00 00 00 00 04 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 02 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 01 03 30 00 00 00 04 00 00 08
00 00 00 00 04 00 26 88 02 00 00 00 1c 00 61 87
50 0a 09 84 ac d1 75 bf 50 0a 09 84 ac d1 75 00
18 04 04 01 01 00 00 00 03 00 00 00


Date / Time: 30/03/17 14:27:31
Sequence number: 4292
Event Type: 400B
Event Category: Internal
Priority: Informative
An event should be checked: false
Event Send Alert: False
Visibility of the event: true
Description: Removed or Replaced RAID Controller Module
Event-specific codes: 0/0/0
Component Type: RAID Controller Module
Component slot: Housing 0, Housing 1
Logged by: RAID Controller Module in Slot 1

Données brutes :
4d 45 4c 48 03 00 00 00 c4 10 00 00 00 00 00 00
0b 40 48 00 93 dd dc 58 00 00 00 00 00 00 00 00
00 00 00 00 04 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 02 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 01 00 00 00 00 00


Date / Time: 30/03/17 14:27:29
Sequence number: 4290
Type of event: 2606
Event Category: Internal
Priority: Informative
An event should be checked: false
Event Send Alert: False
Visibility of the event: true
Description: Routine Beginning of day launched
Event-specific codes: 0/0/0
Component Type: RAID Controller Module
Component slot: Housing 0, Housing 1
Logged by: RAID Controller Module in Slot 1

Données brutes :
4d 45 4c 48 03 00 00 00 c2 10 00 00 00 00 00 00
06 26 48 10 91 dd dc 58 00 00 00 00 00 00 00 00
00 00 00 00 04 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 02 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 01 01 08 00 00 00 04 00 23 08
01 00 00 00


But the Modular Disk Storage Manager shows all disks ok.

I updated firmware to Dell PowerVault MD 34/38 Series Storage Controller Firmware and NVSRAM firmware version 08.25.09.61
What happens with the Storage?

Moderator

 • 

8.5K Posts

April 3rd, 2017 12:00

Hi,

Are you still having the connection issues after the firmware update? How long are the cables? 

15 Posts

April 5th, 2017 05:00

Hi,

The cables are new, they are 4 months old.
I have updated the firmware for a month and the problems are repeated again.

15 Posts

April 5th, 2017 05:00

After the firmware update, we are still faced with a problem. New SAS cables (2m) are in place.

Frequently, hosts lose paths to storage. We do not see at log level where this problem can come from.

Frequently, we have: Removed or Replaced RAID Controller Module (for all controllers), without intervention. 

ESXi and MD3420 Firmwares are up to date.

15 Posts

April 11th, 2017 07:00

How can you help us, we are afraid that the problem will recur again, our infrastructure is in production.

Moderator

 • 

6.9K Posts

April 11th, 2017 10:00

Hello tcrescence,

Wide port errors are generally associated with a controller reset. It may be that the controller reset during its boot sequence after the battery replacement. This is not unusual when changes are made. The controller will see the changes, commit them to running memory, then reboot to commit them to permanent memory.

What you need to look at the event log to see if there are corresponding wide port optimal messages. You also might look for Start of Day Routine start and completed messages. If that’s all completing then you are good to go as there is no error.  If you have a support bundle we can review it to see what the errors are.

Please let us know if you have any other questions.

15 Posts

April 12th, 2017 00:00

I transfer you the logs your analysis.
Sorry, the MD3420 storage array is configured in French.
We must start by reading from the bottom of the document.

[View:/cfs-file/__key/communityserver-discussions-components-files/3412/log_5F00_md3420.log:0:0]

Moderator

 • 

6.9K Posts

April 12th, 2017 14:00

Hello tcrescence,

What I need is a support bundle from your MD3420. You can gather a support bundle by going to the tools tab and selecting gather support logs. Once you have the logs if you can email them to me so that I can review them that would be great. I will send you an email using your address in your profile that you can reply back to with the logs.

Please let us know if you have any other questions.

15 Posts

April 13th, 2017 02:00

Hello,

In attachment, please find MD3420 support data.

Could you send me back an email to tesis-exploitation@nextiraone.eu?

Thanks.

1 Attachment

Moderator

 • 

6.9K Posts

April 13th, 2017 11:00

Hello tcrescence,

I sent you an email. If you can please reply back with the logs that would be great.

Please let us know if you have any other questions.

No Events found!

Top