PowerFlex:SDC“无法访问卷”

Summary: 当 SVM 的本地数据存储区无法在给定时间内响应时,PowerFlex SDC 会记录“失去对卷的访问”。

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

 

  • 在有问题的 SVM 的 ESXi 主机上,RAID 控制器驱动程序lsi_mr3报告本地数据存储的底层磁盘上中止,并且 ESXi 报告丢失对卷的访问权限。

在 VMkernel 日志中:

2017-12-03T17:47:01.634Z cpu54:33648)ScsiDeviceIO: 2636: Cmd(0x43be59ec8a00) 0x1a, CmdSN 0x1f6f4 from world 0 to dev "naa.6800733259adcc4f214574350619b91a" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x24 0x0.
2017-12-03T17:47:44.125Z cpu1:171607)lsi_mr3: mfi_TaskMgmt:254: Processing taskMgmt abort for device: vmhba2:C2:T0:L0
2017-12-03T17:47:44.125Z cpu1:171607)lsi_mr3: mfi_TaskMgmt:262: ABORT
2017-12-03T17:47:45.125Z cpu34:32905)lsi_mr3: mfi_TaskMgmt:254: Processing taskMgmt virt reset for device: vmhba2:C2:T0:L0
2017-12-03T17:47:45.125Z cpu34:32905)lsi_mr3: mfi_TaskMgmt:258: VIRT_RESET cmd # 273733296
2017-12-03T17:47:45.125Z cpu34:32905)lsi_mr3: mfi_TaskMgmt:262: ABORT
2017-12-03T17:47:45.126Z cpu1:171607)lsi_mr3: fusionWaitForOutstanding:2531: megasas: [ 0]waiting for 1 commands to complete
2017-12-03T17:47:46.877Z cpu29:35817)HBX: 2851: 'datastore3': HB at offset 3691008 - Waiting for timed out HB:

在 hostd 日志中:
2017-12-03T17:47:45.126Z info hostd[41B40B70] [Originator@6876 sub=Vimsvc.ha-eventmgr] Event 219 : Lost access to volume 59b2c23a-98396dd8-aa53-84a9c4b71ca1 (datastore3) due to connectivity issues. Recovery attempt is in progress and outcome will be reported shortly.

  • SDS 设备可以报告任务中止(可在 var/log/messages 和 VMware 日志中找到),因此此 SVM 上的 SDS 进程将经历较长的飞行 IO,这会进一步影响 scaleio 系统的稳定性

 

  • SDC 随后在 VMkernel 日志中记录 IO 错误,因为有问题的 SDS 由于本地数据存储的响应缓慢而遇到一些网络套接字问题,并且驻留在 ScaleIO 卷上的应用程序数据存储可能会报告访问丢失:
在 VMkernel 日志中:
2017-12-03T17:47:52.060Z cpu39:33682)scini: netSock_RcvIntrn:1903: ScaleIO R2_0:Error: Failed Success to receive 128 data PTR 0x4306d2923de4 socket 0x4306d2924200
2017-12-03T17:47:54.061Z cpu1:33476)scini: mapVolIO_ReportIOErrorIfNeeded:361: ScaleIO R2_0:[201590843] IO-ERROR comb: 32ba80000015. offsetInComb 11387944. SizeInLB 1. SDS_ID de31ad4800000001. Comb Gen 39. Head Gen 10199.
2017-12-03T17:47:54.061Z cpu1:33476)scini: mapVolIO_ReportIOErrorIfNeeded:374: ScaleIO R2_0:Vol ID 0x756be73300000017. Last fault Status IO_HARD_ERROR(20).Last error Status NOT_CONN(4) Reason (ABORTED) Retry count (2) chan (4)

在 hostd 日志中:
2017-12-03T17:47:54.125Z info hostd[3FAAFB70] [Originator@6876 sub=Vimsvc.ha-eventmgr] Event 220 : Lost access to volume 59cb2f80-40ad26ac-cf4f-84a9c4b71ce1 (OS_windows_01) due to connectivity issues. Recovery attempt is in progress and outcome will be reported shortly.
2017-12-03T17:47:54.125Z info hostd[3FAAFB70] [Originator@6876 sub=Vimsvc.ha-eventmgr] Event 221 : Lost access to volume 59cb2f9e-984e3ff8-63e1-84a9c4b71ce1 (OS_Linux_01) due to connectivity issues. Recovery attempt is in progress and outcome will be reported shortly.
Impact

 

影响

SDC 可能无法访问驻留在 PowerFlex 卷上的数据存储区,并且这些数据存储区上的应用程序或虚拟机可能会受到影响,例如,文件系统变为只读状态。
 

Cause

  • VMFS 数据存储区通过大约每 3 秒从主机向 VMFS 卷发出一次写入作形式的心跳进行监视。当 SVM 的本地数据存储响应缓慢时,检测信号 I/O 的总时间未在 16 秒窗口内完成,数据存储标记为离线,并且 hostd 会生成“失去对卷的访问”日志消息以反映此行为。有关更多详细信息,请参阅 VMware 知识库 文章了解 ESXi 中对卷的丢失访问消息 
  • 在这种情况下,无法预测 SDS 的确切行为,例如,哪些其他 PowerFlex 组件会丢失 keepalive 消息。必须由此 SDS 提供服务的某些 SDC IO 可能会超过作系统或应用程序的超时时间,从而造成影响。

Resolution

联系 VMware 和硬件供应商以修复 RAID 控制器或其固件和驱动程序上的问题。

Additional Information

临时解决方法是删除有问题的 SDS,或将其迁移到另一个良好的本地数据存储。

Affected Products

PowerFlex appliance Intelligent Catalog Software, VxFlex Product Family

Products

PowerFlex rack, VxFlex Ready Nodes, PowerFlex Appliance, PowerFlex custom node, PowerFlex appliance R650, PowerFlex appliance R6525, PowerFlex appliance R660, PowerFlex appliance R6625, Powerflex appliance R750, PowerFlex appliance R760 , PowerFlex appliance R7625, PowerFlex custom node, PowerFlex custom node R650, PowerFlex custom node R6525, PowerFlex custom node R660, PowerFlex custom node R6625, PowerFlex custom node R750, PowerFlex custom node R760, PowerFlex custom node R7625, PowerFlex rack connectivity, PowerFlex rack HW, PowerFlex rack RCM Software, VxFlex Product Family, VxFlex Ready Node, VxFlex Ready Node R640, VxFlex Ready Node R740xd, PowerFlex appliance R640, PowerFlex appliance R740XD, PowerFlex appliance R7525, PowerFlex appliance R840, VxFlex Ready Node R840 ...
Article Properties
Article Number: 000027267
Article Type: Solution
Last Modified: 22 Sep 2025
Version:  4
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.