Start a Conversation

Unsolved

This post is more than 5 years old

V

1017

December 29th, 2016 15:00

MD3000 Storage Array

We have an MD3000 storage array that did something very weird about a month ago.  We have two servers attached to it via HBA adapters.   The servers lost connectivity to the array and our servers disconnected.    Once we rebooted our servers, everything appeared to be fine.  The only errors showing were sector error on a disk.  But when pulling the support information it was referring to disks that had failed 4 months prior.  

Then last week the MD3000 just crashed.   Totally down.  Could not get onto the unit at all.  Flashing Red lights...  the works...   It was in a constant reboot.  One controller would attempt to come up, crash, and then the other controller would do the same.   If we pulled the drives all out of the array we were then able to consistently ping the controllers.   But could never get access to the MDSM.

Our vendor shipped us a replacement unit and we moved all the drives over 1 for 1 along with the controllers.   The result was the same.  Constant reboots.   We were initially thinking that the backplane was the issue.   Next we tried plugging the drives into another unit with new controllers.    The unit did the same thing that time as well.  

We then found an article that talked about putting 2 new drives into the array and powering it up.   We were then able to see the drives in the array and also access MDSM.   From there we put in the remaining drives 1 by 1.   System stayed up the entire time.   We then installed Disk 1 and finally Disk 0.  The system was able to see all 10 drives we had in the system.   However, it did NOT see our Disk Groups!   

How can we get the controllers to read the "foreign config" off of the drives and import it to the controllers so we can access our data?

Is there some type of recovery command we can run?   I see recovery data in the MDSM folder structure on my server...   Is there a way to import that original config in?

HELP!!!!

Thanks,

RIch

Moderator

 • 

7.1K Posts

January 5th, 2017 08:00

Hello Rich,

When you was not able to access MDSM on your MD3000 did you by chance happen to capture the boot message that you was getting using the serial cable on your MD3000 or no? Also if you look in the Recovery Guru is there any error messages listed?

Please let us know if you have any other questions.

2 Posts

January 5th, 2017 14:00

Hi Sam,

I believe we were getting PANIC:  dbm:: versionConvert fail to commit.

And then I think it would try to start the other controller and do the same thing.   

Could not get into Recovery Guru at all when the issue was happening with the constant reboots.

Does that help?    Any advice?  

Moderator

 • 

7.1K Posts

January 11th, 2017 11:00

Hello Rich,

Do you have your serial cable that came with your MD3000? If so then can you capture the boot process of the controllers so that we can see how the controllers are booting & where they are getting stuck at? Here are the setting so that you can connect to the serial port on the controllers for your MD3000.

Startup a terminal emulation program like putty, teraterm, minicom or hyperterminal using these terminal settings (115200-8-n-1).

Please let us know if you have any other questions.

No Events found!

Top