PowerStore:嵌入式 ESXi 节点无法在 PowerStore X 上显示任何协议端点

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

当以太网交换机上配置的最大传输单元 (MTU) 不等于或大于 PowerStore 管理网络上配置的 MTU 时,可能会导致间歇性管理问题。
在此特定示例中,PowerStore X VASA 连接仅对一个嵌入式 ESXi 主机失败(安装在节点 B 上的主机)
 

 

SLN322011_zh_CN__1icon 启用巨型帧(通常 MTU 大小为 9000 字节)时,必须对它们进行一致的端到端设置。配置错误的巨型帧可能会导致连接故障或 IO 性能下降。

  
 

目录

  1. 问题
  2. 解决方案
 

1.问题

在受影响 ESXi 主机上的 /var/log/vvold.log 中观察到这些连接错误:
 
2020-06-24T09:33:08.114Z info vvold[2104948] [Originator@6876 sub=Default] VasaSession::Initialize url is empty 2020-06-24T09:33:08.114Z warning vvold[2104948] [Originator@6876 sub=Default] VasaSession::DoSetContext: Empty VP URL for VP (PowerStore)! 2020-06-24T09:33:08.114Z info vvold[2104948] [Originator@6876 sub=Default] Initialize: Failed to establish connection https://xx.xx.xx.xx:8443/version.xml 2020-06-24T09:33:08.114Z error vvold[2104948] [Originator@6876 sub=Default] Initialize: Unable to init session to VP PowerStore state: 0 2020-06-24T09:33:08.117Z info vvold[2104947] [Originator@6876 sub=Default] VasaSession::GetEndPoint: with url https://xx.xx.xx.xx:8443/version.xml 2020-06-24T09:34:28.895Z warning vvold[2104947] [Originator@6876 sub=Default] VasaSession::GetEndPoint: failed to get endpoint, err=Connection timed out, using default 2020-06-24T09:34:28.896Z info vvold[2104947] [Originator@6876 sub=Default] VasaSession::Initialize url is empty
 

 
这是与以下来自不同系统的日志不同的示例。以下日志是证书故障,是一个完全不同的问题。但是,在上面的示例中,虽然日志的大部分内容相同,但它是一个连接错误。

  
以下是在不同的 ESXi 主机上的 /var/log/vvold.log 中观察到的不同问题的证书错误:

2019-12-26T16:57:03.396Z info vvold[2139844] [Originator@6876 sub=Default] VasaSession::GetEndPoint: with url https://xxxxxxxx.com:8443/version.xml 2019-12-26T16:57:03.401Z warning vvold[2139844] [Originator@6876 sub=Default] VasaSession::GetEndPoint: failed to get endpoint, err=SSL Exception: Verification parameters: --> PeerThumbprint: 0B:01:C4:F2:16:E0:10:C9:63:B5:F2:92:D3:36:B5:65:5C:59:DB:17 --> ExpectedThumbprint: --> ExpectedPeerName: xxxxxxxx.com --> The remote host certificate has these problems: --> --> * Host name does not match the subject name(s) in certificate., using default 2019-12-26T16:57:03.401Z info vvold[2139844] [Originator@6876 sub=Default] VasaSession::Initialize url is empty 2019-12-26T16:57:03.401Z warning vvold[2139844] [Originator@6876 sub=Default] VasaSession::DoSetContext: Empty VP URL for VP (xxxxxxxxx)! 2019-12-26T16:57:03.401Z info vvold[2139844] [Originator@6876 sub=Default] Initialize: Failed to establish connection https://xxxxxxxx.com:8443/version.xml 2019-12-26T16:57:03.401Z error vvold[2139844] [Originator@6876 sub=Default] Initialize: Unable to init session to VP xxxxxxxxx state: 0
 

  

SLN322011_zh_CN__1icon 确保 vvold.log 中的错误是 err=Connection timed out 而不是err=SSL Exception。如果错误是 SSL 异常,请遵循 Vmware KB 67744

  
 

SLN322011_zh_CN__1icon 测试与巨型帧的连接时,请减去 8 个字节的 ICMP 标头以及 20 字节的最小 IP 标头。9000 - 28 = 8972。这两个标头将自动添加,从而增加帧大小。

  
 

对于某些路径,检查从 ESXi 主机的连接失败。在下面的示例中,测试从节点 B 上的一个嵌入式 ESXi 主机到节点 A 上的另一个嵌入式主机的连接: 
 
[root@Powerstore1000X-host-2:~] vmkping -I vmk1 1.2.3.4 -s 8972 -c 2 PING 1.2.3.4 (1.2.3.4): 8972 data bytes 8980 bytes from 1.2.3.4: icmp_seq=0 ttl=64 time=0.327 ms 8980 bytes from 1.2.3.4: icmp_seq=1 ttl=64 time=0.376 ms --- 1.2.3.4 ping statistics --- 2 packets transmitted, 2 packets received, 0% packet loss round-trip min/avg/max = 0.327/0.352/0.376 ms [root@Powerstore1000X-host-2:~] vmkping -I vmk1 1.2.3.5 -s 8972 -c 2 PING 1.2.3.5 (1.2.3.5): 8972 data bytes --- 1.2.3.5 ping statistics --- 2 packets transmitted, 0 packets received, 100% packet loss[root@Powerstore1000X-host-2:~] [root@Powerstore1000X-host-2:~] vmkping -I vmk2 1.2.3.5 -s 8972 -c 2 PING 1.2.3.5 (1.2.3.5): 8972 data bytes 8980 bytes from 1.2.3.5: icmp_seq=0 ttl=64 time=0.303 ms 8980 bytes from 1.2.3.5: icmp_seq=1 ttl=64 time=0.411 ms --- 1.2.3.5 ping statistics --- 2 packets transmitted, 2 packets received, 0% packet loss round-trip min/avg/max = 0.303/0.357/0.411 ms [root@Powerstore1000X-host-2:~] vmkping -I vmk2 1.2.3.4 -s 8972 -c 2 PING 1.2.3.4 (1.2.3.4): 8972 data bytes --- 1.2.3.4 ping statistics --- 2 packets transmitted, 0 packets received, 100% packet loss[root@Powerstore1000X-host-2:~]

 

  
SLN322011_zh_CN__1icon 这些 ping 测试从 ssh 会话向 ESXi 主机运行,有关 vmkping 的详细信息,请参阅 VMWARE KB 1003728

  
 

但是,当使用标准有效负载进行测试时,ping 将会成功(将 -s 8972 更改为 -s 1472):
 
[root@Powerstore1000X-host-2:~] vmkping -I vmk1 1.2.3.4 -s 8972 -c 2 PING 1.2.3.4 (1.2.3.4): 8972 data bytes 8980 bytes from 1.2.3.4: icmp_seq=0 ttl=64 time=0.327 ms 8980 bytes from 1.2.3.4: icmp_seq=1 ttl=64 time=0.376 ms --- 1.2.3.4 ping statistics --- 2 packets transmitted, 2 packets received, 0% packet loss round-trip min/avg/max = 0.327/0.352/0.376 ms [root@Powerstore1000X-host-2:~] vmkping -I vmk1 1.2.3.5 -s 8972 -c 2 PING 1.2.3.5 (1.2.3.5): 8972 data bytes --- 1.2.3.5 ping statistics --- 2 packets transmitted, 0 packets received, 100% packet loss[root@Powerstore1000X-host-2:~] [root@Powerstore1000X-host-2:~] vmkping -I vmk2 1.2.3.5 -s 8972 -c 2 PING 1.2.3.5 (1.2.3.5): 8972 data bytes 8980 bytes from 1.2.3.5: icmp_seq=0 ttl=64 time=0.303 ms 8980 bytes from 1.2.3.5: icmp_seq=1 ttl=64 time=0.411 ms --- 1.2.3.5 ping statistics --- 2 packets transmitted, 2 packets received, 0% packet loss round-trip min/avg/max = 0.303/0.357/0.411 ms [root@Powerstore1000X-host-2:~] vmkping -I vmk2 1.2.3.4 -s 8972 -c 2 PING 1.2.3.4 (1.2.3.4): 8972 data bytes --- 1.2.3.4 ping statistics --- 2 packets transmitted, 0 packets received, 100% packet loss[root@Powerstore1000X-host-2:~]

 

SLN322011_zh_CN__1icon 我们似乎无法 ping 通 VLTi 端口通道。上述示例中的 ping 是否成功取决于所选的源接口,因为每个源接口连接到不同的交换机。

  
 


2.解决方案

 

SLN322011_zh_CN__1icon 在 Dell Networking OS10 或 OS9 交换机上,连接到 PowerStore 的所有接口的 MTU 应设置为 9216。配置错误将会导致此问题。

  
 

SLN322011_zh_CN__1icon 在 10.5.0(2019 年 8 月)之前的 OS10 版本中存在一个问题,在不进行分割的情况下,VLTi 接口端口信道 1000 不传递 MTU 大于 1500 的帧。默认情况下,VLTi 应传递最大 9216 MTU 的帧。

  
 

为了从 OS10 交换机的 CLI 验证我们是否可以传递特定 MTU,命令的格式为ping -M do -s 8972 aaa.bbb.ccc.ddd -c 3. 例如:
 
SWITCH# ping -M do -s 8972 1.2.3.6 -c 3 PING 1.2.3.6 (1.2.3.6) 8972(9000) bytes of data. ping: local error: Message too long, mtu=1500 ping: local error: Message too long, mtu=1500 ping: local error: Message too long, mtu=1500 --- 1.2.3.6 ping statistics --- 3 packets transmitted, 0 received, +3 errors, 100% packet loss, time 2042ms SWITCH# ping -M do -s 2472 1.2.3.6 -c 3 PING 1.2.3.6 (1.2.3.6) 2472(2500) bytes of data. ping: local error: Message too long, mtu=1500 ping: local error: Message too long, mtu=1500 ping: local error: Message too long, mtu=1500 --- 1.2.3.6 ping statistics --- 3 packets transmitted, 0 received, +3 errors, 100% packet loss, time 2039ms SWITCH# ping -M do -s 1472 1.2.3.6 -c 3 PING 1.2.3.6 (1.2.3.6) 1472(1500) bytes of data. 1480 bytes from 1.2.3.6: icmp_seq=1 ttl=64 time=1.05 ms 1480 bytes from 1.2.3.6: icmp_seq=2 ttl=64 time=0.966 ms 1480 bytes from 1.2.3.6: icmp_seq=3 ttl=64 time=1.00 ms --- 1.2.3.6 ping statistics --- 3 packets transmitted, 3 received, 0% packet loss, time 2001ms rtt min/avg/max/mdev = 0.966/1.008/1.059/0.046 ms SWITCH#

 

  • MS Surface PRO -s 交换机用于定义帧的有效负载大小
  • 在上面的输出中,有效负载为 8972(对应于 9000 MTU)的帧发送失败。
  • 在这之后,2472 的有效负载(对应于 2500 MTU)也发送失败
  • 最后,1472 的有效负载(对应于 1500 MTU)成功
  • 在这种情况下,可确认网络路径无法接受大于 1500 MTU 的帧
  • 在此特定的示例中,由于先前介绍的 OS10 缺陷,问题在于 2 x S4148U 之间的 VLTi 端口通道 1000。
 

 

SLN322011_zh_CN__1icon 纠正 MTU 不匹配并重新扫描 ESXi 主机上的存储后,我们可以看到所有预期的协议端点。

 



 

Affected Products

PowerStore
Article Properties
Article Number: 000125860
Article Type: Solution
Last Modified: 19 Apr 2021
Version:  5
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.