Isilon:在提交 OneFS 8.2 升级后,所有已配置的 VLAN 接口上的 Smartconnect 服务 IP 地址 (SSIP) 丢失

Summary: 本文介绍在升级到 8.2 时,在使用 VLAN 配置的任何接口上 SSIP 丢失的事件。

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

在提交到 OneFS 8.2 的升级后,对于配置了 VLAN 标记的任何接口,Smartconnect 服务 IP 地址 (SSIP) 将不再可用于分区名称查询和负载平衡。
提醒:对于配置了 SSIP 和 VLAN 标记的任何子网,此问题不适用。  此外,这不适用于任何其他升级类型或版本。  仅在升级到 8.2 时才适用。
为了说明问题,内部复制反映如何触发问题以及需要什么标准 -
>>我们可以看到所有接口上都存在 SSIP(169.168.1.9、169.168.10.9、169.168.20.9):
MN-X410-CLUS-1# ifconfig
bxe0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
options=507bb<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,TSO6,LRO,VLAN_HWFILTER,VLAN_HWTSO>
       ether 00:0a:f7:
       inet 169.168.1.20 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       inet 169.168.1.9 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       nd6 options=29<PERFORMNUD,IFDISABLED,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
.
.
vlan0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet6 fe80::20a:f7: %vlan0 prefixlen 64 scopeid 0x8 zone 1
       inet 169.168.10.20 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       inet 169.168.10.9 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 10 parent interface: bxe0
vlan1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet6 fe80::20a:f7: %vlan1 prefixlen 64 scopeid 0x9 zone 1
       inet 169.168.20.9 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       inet 169.168.20.20 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 20 parent interface: bxe0        
>> 开始升级到 8.2.0.0:
MN-X410-CLUS-1# uname -a
Isilon OneFS MN-X410-CLUS-1 v8.2.0.0 Isilon OneFS v8.2.0.0 B_8_2_0_0_011(RELEASE): 0x80200500000000B:Thu Jun 20 10:29:21 PDT 2019
    root@sea-build11-01:/b/mnt/obj/b/mnt/src/amd64.amd64/sys/IQ.amd64.release   FreeBSD clang version 3.9.1
(tags/RELEASE_391/final 289601) (based on LLVM 3.9.1) amd64


MN-X410-CLUS-1# ifconfig
bxe0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
options=527bb<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,TSO6,LRO,WOL_MAGIC,VLAN_HWFILTER,VLAN_HWTSO>
       ether 00:0a:f7:
       inet 169.168.1.20 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       inet 169.168.1.9 netmask 0xffffff00 broadcast 169.168.1.255 zone 0
       nd6 options=29<PERFORMNUD,IFDISABLED,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
.
.
vlan0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.10.20 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       inet 169.168.10.9 netmask 0xffffff00 broadcast 169.168.10.255 zone 0
       inet6 fe80::20a:f7: %vlan0 prefixlen 64 scopeid 0x8 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 10 vlanpcp: 0 parent interface: bxe0
       groups: vlan
vlan1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.20.20 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       inet 169.168.20.9 netmask 0xffffff00 broadcast 169.168.20.255 zone 0
       inet6 fe80::20a:f7: %vlan1 prefixlen 64 scopeid 0x9 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 20 vlanpcp: 0 parent interface: bxe0
       groups: vlan
>> 已验证的 vlan 和 SSIP 在重新启动后创建和分配:
MN-X410-CLUS-1# uname -a
Isilon OneFS MN-X410-CLUS-1 v8.2.0.0 Isilon OneFS v8.2.0.0 B_8_2_0_0_011(RELEASE): 0x80200500000000B:Thu Jun 20 10:29:21 PDT 2019
    root@sea-build11-01:/b/mnt/obj/b/mnt/src/amd64.amd64/sys/IQ.amd64.release   FreeBSD clang version 3.9.1
(tags/RELEASE_391/final 289601) (based on LLVM 3.9.1) amd64


MN-X410-CLUS-1# ifconfig
bxe0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
options=527bb<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,TSO6,LRO,WOL_MAGIC,VLAN_HWFILTER,VLAN_HWTSO>
       ether 00:0a:f7:
       inet 169.168.1.20 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       inet 169.168.1.9 netmask 0xffffff00 broadcast 169.168.1.255 zone 0
       nd6 options=29<PERFORMNUD,IFDISABLED,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
.
.
vlan0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.10.20 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       inet 169.168.10.9 netmask 0xffffff00 broadcast 169.168.10.255 zone 0
       inet6 fe80::20a:f7: %vlan0 prefixlen 64 scopeid 0x8 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 10 vlanpcp: 0 parent interface: bxe0
       groups: vlan
vlan1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.20.20 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       inet 169.168.20.9 netmask 0xffffff00 broadcast 169.168.20.255 zone 0
       inet6 fe80::20a:f7: %vlan1 prefixlen 64 scopeid 0x9 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 20 vlanpcp: 0 parent interface: bxe0
       groups: vlan
>> 升级处于提交前状态:
MN-X410-CLUS-1# isi upgrade view

Upgrade Status:

Current Upgrade Activity: OneFS upgrade
   Cluster Upgrade State: Ready to commit
   Upgrade Process State: Running
      Upgrade Start Time: 2019-07-13T12:38:25
      Current OS Version: 8.0.0.6_build(117)style(5)
      Upgrade OS Version: 8.2.0.0_build(11)style(5)
        Percent Complete: 100%

Nodes Progress:

     Total Cluster Nodes: 1
       Nodes On Older OS: 0
          Nodes Upgraded: 1
Nodes Transitioning/Down: 0

LNN  Progress  Version  Status 
--------------------------------
1    100%      8.2.0.0  upgraded
>> 提交升级:
MN-X410-CLUS-1# isi upgrade commit
You are about to COMMIT an upgrade, it CANNOT be rolled back after this, are you sure? (yes/[no]): yes
SSIP is now missing on all vlan interfaces, however non-vlan interface is NOT affected:
MN-X410-CLUS-1# ifconfig
bxe0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
options=527bb<RXCSUM,TXCSUM,VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWCSUM,TSO4,TSO6,LRO,WOL_MAGIC,VLAN_HWFILTER,VLAN_HWTSO>
       ether 00:0a:f7:
       inet 169.168.1.20 netmask 0xffffff00 broadcast 169.168.1.255 zone 1
       inet 169.168.1.9 netmask 0xffffff00 broadcast 169.168.1.255 zone 0
       nd6 options=29<PERFORMNUD,IFDISABLED,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
.
.
vlan0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.10.20 netmask 0xffffff00 broadcast 169.168.10.255 zone 1
       inet6 fe80::20a:f7: %vlan0 prefixlen 64 scopeid 0x8 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 10 vlanpcp: 0 parent interface: bxe0
       groups: vlan
vlan1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 1500
       options=303<RXCSUM,TXCSUM,TSO4,TSO6>
       ether 00:0a:f7:
       inet 169.168.20.20 netmask 0xffffff00 broadcast 169.168.20.255 zone 1
       inet6 fe80::20a:f7: %vlan1 prefixlen 64 scopeid 0x9 zone 1
       nd6 options=21<PERFORMNUD,AUTO_LINKLOCAL>
       media: Ethernet autoselect (10Gbase-SR <full-duplex>)
       status: active
       vlan: 20 vlanpcp: 0 parent interface: bxe0
       groups: vlan

Cause

升级到 8.2 后,flexnet 的配置文件 (flx_config.xml) 被划分为更小、更窄的信息片段。  创建了一个名为 nodeinfo 的新文件夹,其中提供了每个节点的接口和状态信息。  但是,在升级期间,不会获取 vlan 信息,因此将从每个节点节点节点文件中排除任何 vlan 配置。  提交升级后,Smartconnect 会尝试从节点信息文件中读取,并且无法捕获任何 vlan 详细信息以分配 SSIP。
调试isi_smartconnect_d时,我们可以在日志中看到以下错误:
2019-07-15-T12:46:12:DEBUG:0x80c612010:NodeInterfaceGetVlanNic_inlock():nodeinfo.c:1281: Error STATUS_NOT_FOUND (0xc0000225)
2019-07-15-T12:46:12:DEBUG:0x80c612010:NodeInterfaceIsStatus():nodeinfo.c:1385: Error STATUS_NOT_FOUND (0xc0000225)
2019-07-15-T12:46:12:DEBUG:0x80c612010:VIPLoadInterface():vip_coord.c:480: Error STATUS_NETWORK_UNREACHABLE (0xc000023c)

Resolution

仅在升级到 8.2.0 时观察到此问题。该问题已在 8.2.1.0 及更高版本中得到解决。任何升级到高于 8.2.0 的更高版本都不受影响。  如果您怀疑您受此问题的影响,请联系支持部门并参考此知识库文章,然后再继续执行以下步骤。
  1. 要解决此问题,必须强制更改flx_config.xml 和 nodeinfo 配置。  这可以通过启用和禁用 Smartconnect 调试日志记录来触发。
要启用调试日志记录,请执行以下操作:
# isi_sc_log_level -l debug
要将日志记录更改为信息:
# isi_sc_log_level -l info
验证 IP 地址是否已返回并且进程正在运行:
# isi_for_array ifconfig | grep <SSIP>
# isi_for_array -s ps auwx | egrep "(smartconnect|dnsiq)" | grep -v grep
提醒:重新启动守护程序未解决此问题。  守护程序包括isi_dnsiq_d、isi_smartconnect_d和isi_flexnet_d。

Products

Isilon Gen6, Isilon HD400, Isilon NL410, Isilon X210, Isilon X410
Article Properties
Article Number: 000168627
Article Type: Solution
Last Modified: 14 Dec 2023
Version:  4
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.