Isilon:在提交 OneFS 8.2 升级后,所有已配置的 VLAN 接口上的 Smartconnect 服务 IP 地址 (SSIP) 丢失

摘要: 本文介绍在升级到 8.2 时,在使用 VLAN 配置的任何接口上 SSIP 丢失的事件。

本文适用于 本文不适用于 本文并非针对某种特定的产品。 本文并非包含所有产品版本。

症状

在提交到 OneFS 8.2 的升级后,对于配置了 VLAN 标记的任何接口,Smartconnect 服务 IP 地址 (SSIP) 将不再可用于分区名称查询和负载平衡。
提醒:对于配置了 SSIP 和 VLAN 标记的任何子网,此问题不适用。  此外,这不适用于任何其他升级类型或版本。  仅在升级到 8.2 时才适用。
为了说明问题,内部复制反映如何触发问题以及需要什么标准 -
>>我们可以看到所有接口上都存在 SSIP(169.168.1.9、169.168.10.9、169.168.20.9):
MN-X410-CLUS-1# ifconfig
bxe0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
options=507bb<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,TSO6,LRO,VLAN_HWFILTER,VLAN_HWTSO>
       ether 00:0a:f7:
       inet 169.168.1.20 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       inet 169.168.1.9 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       nd6 options=29<PERFORMNUD,IFDISABLED,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
.
.
vlan0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet6 fe80::20a:f7: %vlan0 prefixlen 64 scopeid 0x8 zone 1
       inet 169.168.10.20 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       inet 169.168.10.9 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 10 parent interface: bxe0
vlan1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet6 fe80::20a:f7: %vlan1 prefixlen 64 scopeid 0x9 zone 1
       inet 169.168.20.9 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       inet 169.168.20.20 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 20 parent interface: bxe0        
>> 开始升级到 8.2.0.0:
MN-X410-CLUS-1# uname -a
Isilon OneFS MN-X410-CLUS-1 v8.2.0.0 Isilon OneFS v8.2.0.0 B_8_2_0_0_011(RELEASE): 0x80200500000000B:Thu Jun 20 10:29:21 PDT 2019
    root@sea-build11-01:/b/mnt/obj/b/mnt/src/amd64.amd64/sys/IQ.amd64.release   FreeBSD clang version 3.9.1
(tags/RELEASE_391/final 289601) (based on LLVM 3.9.1) amd64


MN-X410-CLUS-1# ifconfig
bxe0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
options=527bb<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,TSO6,LRO,WOL_MAGIC,VLAN_HWFILTER,VLAN_HWTSO>
       ether 00:0a:f7:
       inet 169.168.1.20 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       inet 169.168.1.9 netmask 0xffffff00 broadcast 169.168.1.255 zone 0
       nd6 options=29<PERFORMNUD,IFDISABLED,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
.
.
vlan0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.10.20 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       inet 169.168.10.9 netmask 0xffffff00 broadcast 169.168.10.255 zone 0
       inet6 fe80::20a:f7: %vlan0 prefixlen 64 scopeid 0x8 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 10 vlanpcp: 0 parent interface: bxe0
       groups: vlan
vlan1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.20.20 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       inet 169.168.20.9 netmask 0xffffff00 broadcast 169.168.20.255 zone 0
       inet6 fe80::20a:f7: %vlan1 prefixlen 64 scopeid 0x9 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 20 vlanpcp: 0 parent interface: bxe0
       groups: vlan
>> 已验证的 vlan 和 SSIP 在重新启动后创建和分配:
MN-X410-CLUS-1# uname -a
Isilon OneFS MN-X410-CLUS-1 v8.2.0.0 Isilon OneFS v8.2.0.0 B_8_2_0_0_011(RELEASE): 0x80200500000000B:Thu Jun 20 10:29:21 PDT 2019
    root@sea-build11-01:/b/mnt/obj/b/mnt/src/amd64.amd64/sys/IQ.amd64.release   FreeBSD clang version 3.9.1
(tags/RELEASE_391/final 289601) (based on LLVM 3.9.1) amd64


MN-X410-CLUS-1# ifconfig
bxe0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
options=527bb<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,TSO6,LRO,WOL_MAGIC,VLAN_HWFILTER,VLAN_HWTSO>
       ether 00:0a:f7:
       inet 169.168.1.20 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       inet 169.168.1.9 netmask 0xffffff00 broadcast 169.168.1.255 zone 0
       nd6 options=29<PERFORMNUD,IFDISABLED,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
.
.
vlan0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.10.20 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       inet 169.168.10.9 netmask 0xffffff00 broadcast 169.168.10.255 zone 0
       inet6 fe80::20a:f7: %vlan0 prefixlen 64 scopeid 0x8 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 10 vlanpcp: 0 parent interface: bxe0
       groups: vlan
vlan1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.20.20 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       inet 169.168.20.9 netmask 0xffffff00 broadcast 169.168.20.255 zone 0
       inet6 fe80::20a:f7: %vlan1 prefixlen 64 scopeid 0x9 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 20 vlanpcp: 0 parent interface: bxe0
       groups: vlan
>> 升级处于提交前状态:
MN-X410-CLUS-1# isi upgrade view

Upgrade Status:

Current Upgrade Activity: OneFS upgrade
   Cluster Upgrade State: Ready to commit
   Upgrade Process State: Running
      Upgrade Start Time: 2019-07-13T12:38:25
      Current OS Version: 8.0.0.6_build(117)style(5)
      Upgrade OS Version: 8.2.0.0_build(11)style(5)
        Percent Complete: 100%

Nodes Progress:

     Total Cluster Nodes: 1
       Nodes On Older OS: 0
          Nodes Upgraded: 1
Nodes Transitioning/Down: 0

LNN  Progress  Version  Status 
--------------------------------
1    100%      8.2.0.0  upgraded
>> 提交升级:
MN-X410-CLUS-1# isi upgrade commit
You are about to COMMIT an upgrade, it CANNOT be rolled back after this, are you sure? (yes/[no]): yes
SSIP is now missing on all vlan interfaces, however non-vlan interface is NOT affected:
MN-X410-CLUS-1# ifconfig
bxe0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
options=527bb<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,TSO6,LRO,WOL_MAGIC,VLAN_HWFILTER,VLAN_HWTSO>
       ether 00:0a:f7:
       inet 169.168.1.20 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       inet 169.168.1.9 netmask 0xffffff00 broadcast 169.168.1.255 zone 0
       nd6 options=29<PERFORMNUD,IFDISABLED,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
.
.
vlan0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.10.20 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       inet6 fe80::20a:f7: %vlan0 prefixlen 64 scopeid 0x8 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 10 vlanpcp: 0 parent interface: bxe0
       groups: vlan
vlan1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.20.20 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       inet6 fe80::20a:f7: %vlan1 prefixlen 64 scopeid 0x9 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 20 vlanpcp: 0 parent interface: bxe0
       groups: vlan

原因

升级到 8.2 后,flexnet 的配置文件 (flx_config.xml) 被划分为更小、更窄的信息片段。  创建了一个名为 nodeinfo 的新文件夹,其中提供了每个节点的接口和状态信息。  但是,在升级期间,不会获取 vlan 信息,因此将从每个节点节点节点文件中排除任何 vlan 配置。  提交升级后,Smartconnect 会尝试从节点信息文件中读取,并且无法捕获任何 vlan 详细信息以分配 SSIP。
调试isi_smartconnect_d时,我们可以在日志中看到以下错误:
2019-07-15-T12:46:12:DEBUG:0x80c612010:NodeInterfaceGetVlanNic_inlock():nodeinfo.c:1281: Error STATUS_NOT_FOUND (0xc0000225)
2019-07-15-T12:46:12:DEBUG:0x80c612010:NodeInterfaceIsStatus():nodeinfo.c:1385: Error STATUS_NOT_FOUND (0xc0000225)
2019-07-15-T12:46:12:DEBUG:0x80c612010:VIPLoadInterface():vip_coord.c:480: Error STATUS_NETWORK_UNREACHABLE (0xc000023c)

解决方案

仅在升级到 8.2.0 时观察到此问题。该问题已在 8.2.1.0 及更高版本中得到解决。任何升级到高于 8.2.0 的更高版本都不受影响。  如果您怀疑您受此问题的影响,请联系支持部门并参考此知识库文章,然后再继续执行以下步骤。
  1. 要解决此问题,必须强制更改flx_config.xml 和 nodeinfo 配置。  这可以通过启用和禁用 Smartconnect 调试日志记录来触发。
要启用调试日志记录,请执行以下操作:
# isi_sc_log_level -l debug
要将日志记录更改为信息:
# isi_sc_log_level -l info
验证 IP 地址是否已返回并且进程正在运行:
# isi_for_array ifconfig | grep <SSIP>
# isi_for_array -s ps auwx | egrep "(smartconnect|dnsiq)" | grep -v grep
提醒:重新启动守护程序未解决此问题。  守护程序包括isi_dnsiq_d、isi_smartconnect_d和isi_flexnet_d。

产品

Isilon Gen6, Isilon HD400, Isilon NL410, Isilon X210, Isilon X410
文章属性
文章编号: 000168627
文章类型: Solution
上次修改时间: 14 12月 2023
版本:  4
从其他戴尔用户那里查找问题的答案
支持服务
检查您的设备是否在支持服务涵盖的范围内。