Unsolved

This post is more than 5 years old

232 Posts

13090

May 29th, 2004 14:00

Error message about Library

I have two PE2650 in a 2-node Win2K3 cluster running two virtual servers Active/Active.  The nodes are connected using QLA 2310 through a Brocade 3200 Fibre Switch to a XioTech Magnitude SAN.  Also connected to the Brocade switch is a PV136T-LTO Library with two drives..  I'm trying to install BUE 9.1 with OFO, Lib Expansion Option, SSO, and Admin Plus Pack.  I'm installing BUE on each node and a third standalone server is running BUE and serves as the Primary SSO Database.

I continue to have a LOT of problems with either Drive 1 (the second drive) in the Library going offline (I have backed up to it, so, don't think that there's really anything wrong), or in getting this error message:

>>Event Type: Error
Event Source: Removable Storage Service
Event Category: None
Event ID: 153
Date:  5/29/2004
Time:  3:14:56 AM
User:  N/A
Computer: VHADENFCLUA
Description:
Did not match serial number "IE72G03863" provided by "Dell (TM) PowerVault (TM) 136T Tape Library (Changer0)" among the list of drives.

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.<<

Doing a Config Dump from the Library, I get this:

>>

LIBRARY INVENTORY REPORT

======================================================================


Library Vendor Id: DELL    
Library Product Id: PV-136T         
Library Product Rev: 3.21
Library Full Firmware: 3.21.0002          
Library Serial Number: DELL13402934001 
Remote Management Version: 180D.00002
RMU Serial Number: 00308C013642
SCSI address: 0
Service Tag: 
Mailbox slots: 12
DRIVES: 2
----------------------------------------------------------------------
Drive 0
  Element Address 256 [0x0100]
  Vendor Id: HP      
  Product Id: Ultrium 1-SCSI  
  Firmware Level: E34A
  Alternate Vendor Id: DELL    
  Alternate Product Id: PV-136T-LTO     
  Alternate Firmware Level: E34A
  Serial Number: IE72G03863
  SCSI address: 1
Drive 1
  Element Address 257 [0x0101]
  Vendor Id: HP      
  Product Id: Ultrium 1-SCSI  
  Firmware Level: E34A
  Alternate Vendor Id: DELL    
  Alternate Product Id: PV-136T-LTO     
  Alternate Firmware Level: E34A
  Serial Number: IE72H03112
  SCSI address: 2
======================================================================




Library Inventory

Number storage slots: 60

Number drive slots: 2

Number mailbox slots: 12<<
So, I know that the drive serial number is right (and, I NEVER had any problems with drive 0, anyway).

Anything I need to do in my config to fix this?

Thanks!

May 31st, 2004 22:00

1.  Where are drives displaying as offline?? (in backup exec, the OS, or the LCD panel..etc)

2.  Do they go offline in the middle of a backup and do the backup jobs fail?

3.  You may want to disable RSM since they are generating the "serial number" messages.  RSM is not needed.

4.  Are the Backup Exec servers operating as a cluster pair? In other words, are you using the backup exec cluster components?

 

232 Posts

June 1st, 2004 12:00

1.  They'll display as offline on the Library front panel display and BUE.

2.  Usually what will happen is that I try a backup (or an inventory, or, just about anything) from BUE, and, it will fail, and the drive will be reported to be offline.

3.  Not sure on how to disable the RSM, and, is that really necessary.  Could I just have a bad drive?  I have changed cables and terminators around, and, never have problems with the robotics, or other drive.

4.  With the three BUE servers, one is standalone, used to backup other network servers, while the other two are setup in a cluster, with a virtual BUE server, used only to backup that Active/Active cluster.

Thanks!

June 2nd, 2004 02:00

RSM can be disabled via the OS Services utility.  (it is a service and can be set to disabled).  RSM can easily interfere with the tape device, thus conflicting with Backup Exec.  Once the drive goes offline, you will need to reset the library first, than the server.  It is hard to say that the drive is faulty, but to further isolate:

1.  Make sure the library has the latest firmware.

2.  Disable the RSM service.

3.  Power Down all servers that have access to the library

4.  Power down the library (power it down from the rear switch of the library).  Keep off for 30 seconds, than power on

5.  Wait until the library is fully initialized.  (about 5 minutes)

6.  Power on each server one at a time. Start Backup Exec and make sure that all device are detected.

In your fourth statement, you mentioned that the two cluster nodes are setup as a cluster and a virtual BUE.    I assume that this means the two node cluster has Backup Exec cluster components installed, correct?  If you look in your MS cluster administrator, is there a Backup Exec group?

When the drive goes offline, do you see any SAC error codes on the LCD panel?  If so, what are the error codes?

232 Posts

June 2nd, 2004 06:00

1.  It does--just updated it this weekend with the very newest.

2.  Will try.

3, 4, 5, 6.  I've done this a few times, and, sometimes things work, and, sometimes they don't.

I tried swapping the two drives, to see if the error moves to the other one, or, stays with the same one.

I do see some codes on the panel, but, wouldn't they also be in the logs?

Thanks.

4 Posts

June 2nd, 2004 12:00

I have a question for you do you have the device enabled or diabled in device manger thru the os ?

I use BAB v901 I had to disable it in the os for it to work properly I have a 130 t and 136 t in a SAN emc16bs and 2 56f and pv 35f connected together

The only thing I can tell you In brightstore I was getting a lot of scsi errors I disable it now it works fine .

I dont know if that will help good luck .

'

232 Posts

June 2nd, 2004 13:00

It's enabled.  I've never read/heard to do it any other way, but, will check into it.

Thanks!

Top