Unsolved
This post is more than 5 years old
1 Message
0
1127
February 20th, 2018 12:00
Dell MD3000i SAN Controller Issue
We have been receiving periodic errors every couple hours regarding loss of communication to one of our controllers - Dell MD3000i dual controller. It appears to go offline briefly (I can't ping the iscsi port) but come back up and the errors clear. The errors started after replacing a failed controller with a new one. I have now tried replacing the controller with another 'new' one but the errors are still happening. All VD's are on controller 0 as there primary path but we have also been getting degraded port errors on controller 0. I attached the snippets from the event log:
vent type: 1707
Event category: Error
Priority: Critical
Description: Degraded wide port becomes failed
Event specific codes: 0/0/0
Component type: Enclosure Component (EMM, GBIC/SFP, Power Supply, or Fan)
Component location: Enclosure 0, Slot 0Logged by: RAID Controller Module in slot 0
Date/Time: 2/20/18 12:06:38 PM
Sequence number: 36897
Event type: 1706
Event category: Error
Priority: Critical
Description: Optimal wide port becomes degraded
Event specific codes: 0/0/0
Component type: Enclosure Component (EMM, GBIC/SFP, Power Supply, or Fan)
Component location: Enclosure 0, Slot 0
Logged by: RAID Controller Module in slot 0
We don't have many servers running on this SAN but would like to try to at least get it stable. If anyone has any thoughts or can take a look at our logs I would appreciate it.



DELL-Sam L
Moderator
•
7.8K Posts
0
February 21st, 2018 07:00
Hello pkelly5573,
Do you have any MD1000 expansion enclosures attached to your MD3000? Wide port errors are generally associated with a controller reset. Now what may have happened, is that your controller could have reset during its boot sequence. This is not unusual when changes are made. The controller will see the changes, commit them to running memory, then reboot to commit them to permanent memory.
What is need is to look at the event log. You want to see if there are corresponding wide port optimal messages. You also might look for Start of Day Routine start and completed messages. If that’s all completing then you are good to go as there is no error.
When you replaced the controller did the replacement controller update its firmware to match your secondary controller?
Please let us know if you have any other questions.