Unsolved
This post is more than 5 years old
232 Posts
0
13090
May 29th, 2004 14:00
Error message about Library
I have two PE2650 in a 2-node Win2K3 cluster running two virtual servers Active/Active. The nodes are connected using QLA 2310 through a Brocade 3200 Fibre Switch to a XioTech Magnitude SAN. Also connected to the Brocade switch is a PV136T-LTO Library with two drives.. I'm trying to install BUE 9.1 with OFO, Lib Expansion Option, SSO, and Admin Plus Pack. I'm installing BUE on each node and a third standalone server is running BUE and serves as the Primary SSO Database.
I continue to have a LOT of problems with either Drive 1 (the second drive) in the Library going offline (I have backed up to it, so, don't think that there's really anything wrong), or in getting this error message:
>>Event Type: Error
Event Source: Removable Storage Service
Event Category: None
Event ID: 153
Date: 5/29/2004
Time: 3:14:56 AM
User: N/A
Computer: VHADENFCLUA
Description:
Did not match serial number "IE72G03863" provided by "Dell (TM) PowerVault (TM) 136T Tape Library (Changer0)" among the list of drives.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.<<
Doing a Config Dump from the Library, I get this:
>>
LIBRARY INVENTORY REPORT ====================================================================== Library Vendor Id: DELL Library Product Id: PV-136T Library Product Rev: 3.21 Library Full Firmware: 3.21.0002 Library Serial Number: DELL13402934001 Remote Management Version: 180D.00002 RMU Serial Number: 00308C013642 SCSI address: 0 Service Tag: Mailbox slots: 12 DRIVES: 2 ---------------------------------------------------------------------- Drive 0 Element Address 256 [0x0100] Vendor Id: HP Product Id: Ultrium 1-SCSI Firmware Level: E34A Alternate Vendor Id: DELL Alternate Product Id: PV-136T-LTO Alternate Firmware Level: E34A Serial Number: IE72G03863 SCSI address: 1 Drive 1 Element Address 257 [0x0101] Vendor Id: HP Product Id: Ultrium 1-SCSI Firmware Level: E34A Alternate Vendor Id: DELL Alternate Product Id: PV-136T-LTO Alternate Firmware Level: E34A Serial Number: IE72H03112 SCSI address: 2 ====================================================================== Library Inventory Number storage slots: 60 Number drive slots: 2 Number mailbox slots: 12<<
So, I know that the drive serial number is right (and, I NEVER had any problems with drive 0, anyway).
Anything I need to do in my config to fix this?
Thanks!


dell-richard g
605 Posts
0
May 31st, 2004 22:00
1. Where are drives displaying as offline?? (in backup exec, the OS, or the LCD panel..etc)
2. Do they go offline in the middle of a backup and do the backup jobs fail?
3. You may want to disable RSM since they are generating the "serial number" messages. RSM is not needed.
4. Are the Backup Exec servers operating as a cluster pair? In other words, are you using the backup exec cluster components?
Bill Bradley
232 Posts
0
June 1st, 2004 12:00
1. They'll display as offline on the Library front panel display and BUE.
2. Usually what will happen is that I try a backup (or an inventory, or, just about anything) from BUE, and, it will fail, and the drive will be reported to be offline.
3. Not sure on how to disable the RSM, and, is that really necessary. Could I just have a bad drive? I have changed cables and terminators around, and, never have problems with the robotics, or other drive.
4. With the three BUE servers, one is standalone, used to backup other network servers, while the other two are setup in a cluster, with a virtual BUE server, used only to backup that Active/Active cluster.
Thanks!
dell-richard g
605 Posts
0
June 2nd, 2004 02:00
RSM can be disabled via the OS Services utility. (it is a service and can be set to disabled). RSM can easily interfere with the tape device, thus conflicting with Backup Exec. Once the drive goes offline, you will need to reset the library first, than the server. It is hard to say that the drive is faulty, but to further isolate:
1. Make sure the library has the latest firmware.
2. Disable the RSM service.
3. Power Down all servers that have access to the library
4. Power down the library (power it down from the rear switch of the library). Keep off for 30 seconds, than power on
5. Wait until the library is fully initialized. (about 5 minutes)
6. Power on each server one at a time. Start Backup Exec and make sure that all device are detected.
In your fourth statement, you mentioned that the two cluster nodes are setup as a cluster and a virtual BUE. I assume that this means the two node cluster has Backup Exec cluster components installed, correct? If you look in your MS cluster administrator, is there a Backup Exec group?
When the drive goes offline, do you see any SAC error codes on the LCD panel? If so, what are the error codes?
Bill Bradley
232 Posts
0
June 2nd, 2004 06:00
1. It does--just updated it this weekend with the very newest.
2. Will try.
3, 4, 5, 6. I've done this a few times, and, sometimes things work, and, sometimes they don't.
I tried swapping the two drives, to see if the error moves to the other one, or, stays with the same one.
I do see some codes on the panel, but, wouldn't they also be in the logs?
Thanks.
meeep
4 Posts
0
June 2nd, 2004 12:00
I have a question for you do you have the device enabled or diabled in device manger thru the os ?
I use BAB v901 I had to disable it in the os for it to work properly I have a 130 t and 136 t in a SAN emc16bs and 2 56f and pv 35f connected together
The only thing I can tell you In brightstore I was getting a lot of scsi errors I disable it now it works fine .
I dont know if that will help good luck .
'
Bill Bradley
232 Posts
0
June 2nd, 2004 13:00
It's enabled. I've never read/heard to do it any other way, but, will check into it.
Thanks!