Unsolved
This post is more than 5 years old
2 Intern
•
127 Posts
0
529
March 1st, 2010 00:00
HBA failed
Hi
I have a Solaris 8 server attached to a EMC Symmetrix 8430 array. Having 2 HBAs with Powerpath.
root@CSCUSDMYSPDB1 # powermt display
Symmetrix logical device count=21
CLARiiON logical device count=0
==============================================================================
----- Host Bus Adapters --------- ------ I/O Paths ----- ------ Stats ------
### HW Path Summary Total Dead IO/Sec Q-IOs Errors
==============================================================================
0 pci@8,700000/fibre-channel@2 failed 21 21 - 0 21
1 pci@9,700000/fibre-channel@4 optimal 21 0 - 8 0
root@CSCUSDMYSPDB1 # powermt version
EMC powermt for PowerPath (c) Version 3.0.4
root@CSCUSDMYSPDB1 # uname -a
SunOS CSCUSDMYSPDB1 5.8 Generic_108528-27 sun4u sparc SUNW,Sun-Fire-880
root@CSCUSDMYSPDB1 #
This server is located remotely so I have to rely on the local hands and legs guy to check.
He said both HBAs and SAN ports connecting to the HBAs are green and blinking continously.
What could be the real problem? Server has been up for 499 days. I don't see any scsi errors on the messages file.
But in format, I notice 1 drive is down.
6. c2t16d0
/pci@8,700000/fibre-channel@2/sd@10,0
7. c2t16d23
/pci@8,700000/fibre-channel@2/sd@10,17
8. c2t16d24
/pci@8,700000/fibre-channel@2/sd@10,18
9. c2t16d25
/pci@8,700000/fibre-channel@2/sd@10,19
10. c2t16d26
/pci@8,700000/fibre-channel@2/sd@10,1a


Raayman
47 Posts
0
March 1st, 2010 03:00
Hi,
since you are talking about just one device (out of multiple) on the same target, I would find it very unlikely that the HBA is defective. If that were the case, you would be seeing all paths to/from that HBA showing up with an error message. To check more precisely you could perform a:
powermt display dev=all
to see if it's just for that one device.
Have you by any chance checked to see if any changes were made in the mapping/masking/zoning? Also, you could try to check if anything changed in your sd.conf that might have caused this.