Unsolved

This post is more than 5 years old

2 Intern

 • 

127 Posts

529

March 1st, 2010 00:00

HBA failed

Hi

I have a Solaris 8 server attached to a EMC Symmetrix 8430 array. Having 2 HBAs with Powerpath.

root@CSCUSDMYSPDB1 # powermt display
Symmetrix logical device count=21
CLARiiON logical device count=0
==============================================================================
----- Host Bus Adapters ---------  ------ I/O Paths -----  ------ Stats ------
### HW Path                        Summary   Total   Dead  IO/Sec Q-IOs Errors
==============================================================================
  0 pci@8,700000/fibre-channel@2   failed       21     21       -     0     21
  1 pci@9,700000/fibre-channel@4   optimal      21      0       -     8      0

root@CSCUSDMYSPDB1 # powermt version
EMC powermt for PowerPath (c) Version 3.0.4
root@CSCUSDMYSPDB1 # uname -a
SunOS CSCUSDMYSPDB1 5.8 Generic_108528-27 sun4u sparc SUNW,Sun-Fire-880
root@CSCUSDMYSPDB1 #

This server is located remotely so I have to rely on the local hands and legs guy to check.

He said both HBAs and SAN ports connecting to the HBAs are green and blinking continously.

What could be the real problem? Server has been up for 499 days. I don't see any scsi errors on the messages file.


But in format, I notice 1 drive is down.

6. c2t16d0
          /pci@8,700000/fibre-channel@2/sd@10,0
       7. c2t16d23
          /pci@8,700000/fibre-channel@2/sd@10,17
       8. c2t16d24
          /pci@8,700000/fibre-channel@2/sd@10,18
       9. c2t16d25
          /pci@8,700000/fibre-channel@2/sd@10,19
      10. c2t16d26
          /pci@8,700000/fibre-channel@2/sd@10,1a

47 Posts

March 1st, 2010 03:00

Hi,

since you are talking about just one device (out of multiple) on the same target, I would find it very unlikely that the HBA is defective. If that were the case, you would be seeing all paths to/from that HBA showing up with an error message. To check more precisely you could perform a:

powermt display dev=all

to see if it's just for that one device.

Have you by any chance checked to see if any changes were made in the mapping/masking/zoning? Also, you could try to check if anything changed in your sd.conf that might have caused this.

Top