VPLEX:由于来自存储阵列的 scsi 检查条件 B/44/00,存储卷处于严重故障状态
Summary: 由于来自底层存储阵列的 scsi 感知代码 B/44/00,VPLEX 将磁盘标记为无效。
Symptoms
VPLEX 存储卷处于严重故障状态,阵列端的底层卷运行状况良好。
来自 cli 命令的输出示例: ll /clusters/cluster-1/storage-elements/storage-volumes/<storage-volume name>
VPlexcli:/> ll /clusters/cluster-1/storage-elements/storage-volumes/Symm1234_0000 /clusters/cluster-1/storage-elements/storage-volumes/Symm1234_0000: Name Value ----------------------------- ------------------------------------------------ application-consistent false block-count 65798400 block-size 4K capacity 251G description - free-chunks [] health-indications [hardware dead] health-state critical-failure <<-- io-status dead itls 0x50001442607de710/0x5000097378028844/80, 0x50001442607de710/0x50000973780288c4/80, 0x50001442607de711/0x5000097378028844/80, 0x50001442607de711/0x50000973780288c4/80, 0x50001442707de710/0x5000097378028844/80, 0x50001442707de710/0x50000973780288c4/80, 0x50001442707de711/0x5000097378028844/80, 0x50001442707de711/0x50000973780288c4/80, 0x50001442a0100410/0x5000097378028844/80, 0x50001442a0100410/0x50000973780288c4/80, ... (16 total) largest-free-chunk 0B locality - operational-status error provision-type legacy storage-array-name EMC-SYMMETRIX-XXXXXXXXX storage-volumetype normal system-id VPD83T3:60000097000000000000000000000000 thin-capable false thin-rebuild true total-free-space 0B underlying-storage-block-size 512 use unusable used-by [extent_Symm1234_0000_1] vendor-specific-name EMC
存储卷处于 严重故障 状态
VPlexcli:/> storage-volume summary SUMMARY (cluster-1) StorageVolume Name IO Status Operational Status Health State ---------------------------------------- --------- ------------------ ---------------- Symm1234_0000 dead error critical-failure <<--
Dial Home 的示例摘录,突出显示了终止的逻辑单元:
<EventData><![CDATA[disk VPD83T3:60000970000000000000000000000000: disk died, no mirror exists
如果此存储卷不是分布式设备的一部分,则主机将失去对此虚拟卷的访问权限,从而导致数据不可用的情况。
VPlex 固件日志描述了流式 scsi/27 事件和 SCSI 感知代码密钥 0xb asc 0x44 ascq 0x0 (B/44/00), 这会导致 内部目标故障。
来自 firmware.log 的 scsi/27 实例示例
128.221.253.37/cpu0/log:5988:W/"0060166fcbfe192045-1":3298782:<6>2016/11/11 06:50:57.94: scsi/27 tgt VPD83T3:60000097000000000000000000000000 cmd 0x2a status 0x2 valid 0 resp 0x70 seg 0x0 bits 0x0 key 0xb info 0x0 alen 10 csi 0x0 asc 0x44 ascq 0x0 fru 0x0 sks 0x0 128.221.253.37/cpu0/log:5988:W/"0060166fcbfe192045-1":3298783:<6>2016/11/11 06:50:58.00: scsi/27 tgt VPD83T3:60000097000000000000000000000000 cmd 0x2a status 0x2 valid 0 resp 0x70 seg 0x0 bits 0x0 key 0xb info 0x0 alen 10 csi 0x0 asc 0x44 ascq 0x0 fru 0x0 sks 0x0 128.221.253.37/cpu0/log:5988:W/"0060166fcbfe192045-1":3298784:<6>2016/11/11 06:50:58.16: scsi/27 tgt VPD83T3:60000097000000000000000000000000 cmd 0x2a status 0x2 valid 0 resp 0x70 seg 0x0 bits 0x0 key 0xb info 0x0 alen 10 csi 0x0 asc 0x44 ascq 0x0 fru 0x0 sks 0x0 128.221.253.37/cpu0/log:5988:W/"0060166fcbfe192045-1":3298785:<6>2016/11/11 06:50:58.30: scsi/27 tgt VPD83T3:60000097000000000000000000000000 cmd 0x2a status 0x2 valid 0 resp 0x70 seg 0x0 bits 0x0 key 0xb info 0x0 alen 10 csi 0x0 asc 0x44 ascq 0x0 fru 0x0 sks 0x0
执行了 21 次重试,由于连续写入失败,存储卷被标记为无效。
128.221.253.37/cpu0/log:5988:W/"0060166fcbfe192045-1":3298797:<4>2016/11/11 06:51:01.42: amf/45 disk VPD83T3:60000097000000000000000000000000: write failure: marking this in-use disk dead 128.221.253.37/cpu0/log:5988:W/"0060166fcbfe192045-1":3298798:<2>2016/11/11 06:51:01.42: amf/97 disk VPD83T3:60000097000000000000000000000000: disk died, no mirror exists
Cause
存储阵列为写入 I/O 请求 (0x2a) 返回 B/44/00,阵列无法处理 I/O 请求,并使用检查条件 B/44/00(内部目标故障)进行响应
如果底层存储卷是 VMAX 卷,则可能会由于以下给出的原因而发生 I/O 故障:
VMAX 中止 IO 的原因未提供给 VPlex。发生内部目标故障的原因有很多,可能是由非常广泛的来源引起的。
Resolution
向 VPLEX 发送 scsi 检查条件 B/44/00 的存储阵列必须由相应的阵列供应商进行调查。此问题是由存储阵列上的“内部目标故障”问题导致阵列无法处理写入 I/O 请求触发的。
如果底层阵列是 VMAX 阵列,则阵列可以出于多种原因发送 scsi 检查条件“B/44/00”。
可以在 VPLEX 管理服务器上运行以下 cli 命令,以获取受 B/44/00 检查条件影响的逻辑单元列表:
grep "scsi/27" firmware.log* | grep 'key 0xb' |awk '{print $3, $5, $18, $19, $26,$27,$28,$29}'| sort | uniq -c | sort -nr
示例:
service@ManagementServer:~> grep "scsi/27" firmware.log* | grep 'key 0xb' | awk '{print $3, $5, $18, $19, $26,$27,$28,$29}'| sort | uniq -c | sort -nr
103474 scsi/27 VPD83T3:60000097000000000000000000000000 key 0xb asc 0x44 ascq 0x0
45254 scsi/27 VPD83T3:60000097000000000000000000000001 key 0xb asc 0x44 ascq 0x0
32908 scsi/27 VPD83T3:60000097000000000000000000000002 key 0xb asc 0x44 ascq 0x0
25581 scsi/27 VPD83T3:60000097000000000000000000000003 key 0xb asc 0x44 ascq 0x0
21043 scsi/27 VPD83T3:60000097000000000000000000000004 key 0xb asc 0x44 ascq 0x0
10809 scsi/27 VPD83T3:60000097000000000000000000000005 key 0xb asc 0x44 ascq 0x0
10651 scsi/27 VPD83T3:60000097000000000000000000000006 key 0xb asc 0x44 ascq 0x0
如果这是非 EMC 阵列,请联系相应的阵列供应商,以解决存储阵列上存在的问题。
Additional Information
这不是 VPlex 问题,这可能是后端阵列或结构问题。
必须联系存储阵列团队和/或连接。