ScaleIO:MDM_Disconnect错误故障排除
Summary: 主元数据管理器 (MDM) 所有权频繁地在 MDM 服务器之间移动。
This article applies to
This article does not apply to
This article is not tied to any specific product.
Not all product versions are identified in this article.
Symptoms
使用 showevents.py 工具时,将显示以下事件:
6956 2017-07-06 18:21:05.803 MDM_CLUSTER_LOST_CONNECTION WARNING The MDM, ID 27fea9a11c073e82, lost connection
辅助 MDM 服务器的 trc 日志中显示以下内容:
06/07 18:21:05.486947 0x7ffbc89feeb0:netPath_IsKaNeeded:01858: :: Connected Live CLIENT path 0x7ffb9400a060 of portal 0x7ffb94003780 net 0x7ffbac0044b0 socket 17 inflights 0 didn't receive message for 3 iterations from 10.xxx.xxx.xxx:9011. Marking as down
Cause
当辅助 MDM 或仲裁在 500 毫秒的超时期限内未看到保持活动状态时,通常会发生 MDM 断开连接。
Resolution
检查 MDM 和 TB 服务器上的网络接口卡 (NIC) 是否丢弃了数据包:
[root@scaleio-1 ~]# ifconfig ens192 ens192: flags=4163 mtu 1500 inet 10.xxx.xxx.xxx netmask 255.xxx.xxx.0 broadcast 10.xxx.xxx.xxx inet6 fe80::250:56ff:feb7:2a06 prefixlen 64 scopeid 0x20 ether 00:50:56:b7:2a:06 txqueuelen 1000 (Ethernet) RX packets 311779767 bytes 53460032583 (49.7 GiB) RX errors 0 dropped 41 overruns 0 frame 0 TX packets 312147963 bytes 45970694962 (42.8 GiB) TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0
此外,使用 ping 命令检查 MDM 节点与 TB 之间的连接延迟:
[root@scaleio-1 ~]# ping 10.xxx.xxx.xxx PING 10.xxx.xxx.xxx (10.xxx.xxx.xxx) 56(84) bytes of data. 64 bytes from 10.xxx.xxx.xxx: icmp_seq=1 ttl=64 time=0.414 ms 64 bytes from 10.xxx.xxx.xxx: icmp_seq=2 ttl=64 time=0.395 ms 64 bytes from 10.xxx.xxx.xxx: icmp_seq=3 ttl=64 time=0.370 ms 64 bytes from 10.xxx.xxx.xxx: icmp_seq=4 ttl=64 time=0.399 ms 64 bytes from 10.xxx.xxx.xxx: icmp_seq=5 ttl=64 time=0.497 ms 64 bytes from 10.xxx.xxx.xxx: icmp_seq=6 ttl=64 time=0.534 ms
如果延迟发生变化或接近 500 毫秒,则可能是断开连接的问题。
MDM 断开连接也有非网络原因。如果进程挂起或未收到足够的 CPU 资源,则无法及时发送 keepalive 数据包。使用 top 命令检查系统的 CPU 利用率。
在 VMware 系统上,如果系统超额订阅,虚拟机 (VM) 可能无法获得足够的资源。您可以通过检查虚拟机的 CPU 就绪时间来检查是否是这种情况。
Affected Products
VxFlex Product FamilyProducts
PowerFlex Software, VxFlex Product FamilyArticle Properties
Article Number: 000064168
Article Type: Solution
Last Modified: 20 May 2025
Version: 3
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.