Unsolved
This post is more than 5 years old
7 Posts
0
32483
August 28th, 2005 22:00
mega_HAM_Timeout : ERROR : timeout on cloneMbox
Hi all,
I have a PE 1600SC that began reporting a failed drive in the PERC4 RAID array today. Upon reviewing the console log (NW 6.5 SP3-CPR), I am finding the following errors:
mega_HAM_Timeout : ERROR : timeout on cloneMbox 0x926460, hacb 0xB6E1B520 for in
stance [0]
mega_HAM_ISR : ERROR : cloneMbox with cmd 6F is serviced already.
Aug 26, 2005 7:49:05 PM org.apache.jk.server.JkCoyoteHandler action
SEVERE: Error in action code
java.io.IOException: Operation would block
stance [0]
mega_HAM_ISR : ERROR : cloneMbox with cmd 6F is serviced already.
Aug 26, 2005 7:49:05 PM org.apache.jk.server.JkCoyoteHandler action
SEVERE: Error in action code
java.io.IOException: Operation would block
and
mega_HAM_Timeout : ERROR : timeout on cloneMbox 0x938960, hacb 0xB6E1BE20 for in
stance [0]
mega_HAM_ISR : ERROR : cloneMbox with cmd 16 is serviced already.
stance [0]
mega_HAM_ISR : ERROR : cloneMbox with cmd 16 is serviced already.
This appeared in the log Friday evening (during tape backups), and the drive failure apparently occured early this morning (Sunday).
I see from other posts about this message, that it may mean the PERC controller may be failing and needs to be replaced. Since this error occured 2 days before the drive failure, could this indicate that the drive hasn't actually failed, but rather the controller believes it has? Any recommendations?
Also, how do I disable the PERC's beeping alarm?
By the way, the server and PERC have the latest firmware, and the PERC's HAM is ver 7.02.03 (C)
Thanks much!
0 events found
No Events found!


Gman05
7 Posts
0
August 29th, 2005 12:00
Thanks,
I've been into Dellmgr and disabled the alarm. The system ID is B4JT931. The server has had all the recent Novell patches installed as well as firmware updates from Dell. The server had been abending and/or hanging until the installation of NW6.5 SP3 and related patches about 30 days ago, so there is a lengthy abend.log.
I'm not sure about array manager. I only recently began working with this server, and am unfamiliar with dell specific tools, as I primarily have used HP in the past. I do see a dellmon.nlm. What is array mgr, and can I download it if it's not installed?
Thanks
Carrie_1
2 Intern
•
188 Posts
0
August 29th, 2005 12:00
If you loaded the Dell PEDGE3.HAM, it should have copied DELLMGR.NLM into SYS:SYSTEM. When you run this module from the console, it looks like the CTRL+M PERC BIOS. You can disable the alarm there.
As for the mega_HAM_Timeout error, this is going to take some troubleshooting. It is possible that it is a PERC failure, but I think at this point it is unlikely.
I would start by making sure all software has the latest patches installed. Check to see you an ABEND.LOG exists in SYS:SYSTEM.
Also, you can pull a log from the PERC card itself. It can pulled using Array Manager if you have it installed and configured or Dell technical support can provide you with a tool called TTY that will allow you to pull the PERC log from DOS.
Please post your service tag number and let me know if your server is abending and if you have Array Manager installed.
Carrie
Carrie_1
2 Intern
•
188 Posts
0
August 29th, 2005 12:00
Looks like your server has a PERC4dc. Check to see what firmware you have installed. You should be able to find it in DELLMGR in Object - Adapter - Other Adapter information.
Also, all the third party software needs to be patched, which includes your backup software.
As for Open Manage, take a look in the autoexec.ncf towards the bottom and see if you see any comments for Dell software. If you do, copy the lines here and I'll tell you what you have installed. I would also like to know if you have Symantec Antivirus installed.
Carrie
Message Edited by DELL-Carrie on 08-29-2005 09:14 AM
Gman05
7 Posts
0
August 29th, 2005 13:00
Carrie_1
2 Intern
•
188 Posts
0
August 29th, 2005 17:00
I am about to email you a file to pull the PERC log. You will need to down the server to DOS.
Carrie
Gman05
7 Posts
0
August 30th, 2005 13:00
€
Carrie_1
2 Intern
•
188 Posts
0
August 30th, 2005 14:00
The tty log looks fine. I hadn't seen the Novell TID you posted.
Please call into Dell technical support and ask them to replace the raid card. Give the Dell tech a link to this forum post. Please let me know if replacing the PERC resolves the error.
Carrie
Carrie_1
2 Intern
•
188 Posts
0
August 30th, 2005 15:00
You should get the replacement PERC card today. Let me know if this fixes the error.
Carrie
Gman05
7 Posts
0
August 30th, 2005 15:00
I have. After 3 hrs of telephone time, they tell me this error is a software error, not a hardware problem. After much wrangling, they agreed to send me a card, which I just received. I'll let you know the outcome.
Any idea why the log didn't have any of the more recent entries in it?
Thanks again for your assistance. I really do appreciate it.
Kayak64
12 Posts
0
September 21st, 2005 11:00
cac4
10 Posts
0
September 21st, 2005 20:00
I've had it on 2 servers, myself, and I posted it here before. The tech I got (lucky me) knew what he was doing, and from those logs, was able to determine "without a doubt", according to him, that this error indicates bad ram on the raid controller. its not the controller itself. But both times, they just replaced the controller.
Must have had a bad production run, as these 2 servers were identical, and purchased at the same time. both had the problem.
Steven Goh
1 Message
0
September 22nd, 2005 07:00
Hi Gman5, has your issue been resolved after replacement of the PERC controller? Cause currently I am having a customer tha facing the same issue like yours.
Kayak64
12 Posts
0
September 22nd, 2005 08:00
Gman05
7 Posts
0
September 22nd, 2005 12:00