PowerFlex Manager - SO 節點在功能update_clc_node_agent更新失敗

Summary: PFxM 無法在「update_clc_node_agent」功能期間升級僅儲存 (SO) 節點,導致在嘗試將 SDS 放入 PMM 時升級作業停止。

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

案例

  • 環境:高可用性 (HA) CloudLink Center 裝置
  • 問題:SO 資源組僅列出兩個預期的 CLC VM 中的一個。
  • 症狀:升級 SO 節點失敗,因為節點不在 PMM 中。

 

此情況表示升級失敗的觸發點。以下是執行 update_clc_node_agent 作用:

健全堆疊範例:

記錄位置:Job-afe400aa-d7fe-4897-9a04-fe08b924c4ae-0-1/deployment.logs

DEBUG [2024-12-16T11:20:36.199529] 13742: service_deployment.rb:5348:in `process_firmware_update': Processing firmware update after selecting resources
DEBUG [2024-12-16T11:20:36.200310] 13742: service_deployment.rb:5353:in `block in process_firmware_update': Processing firmware update on rackserver-xxxxxxx
INFO  [2024-12-16T11:20:36.201536] 13742: service_deployment.rb:5363:in `block in process_firmware_update': Updating CLC Agent update on vmcl01-esxi08.dell.lab
DEBUG [2024-12-16T11:20:36.201933] 13742: service_deployment.rb:5365:in `block in process_firmware_update': Updating CLC Agent version on node svm-vmcl01-esxi08
DEBUG [2024-12-16T11:20:36.202379] 13742: type/base.rb:412:in `delegate': service_deployment.rb:5366:in `block in process_firmware_update' calling delegated method update_clc_node_agent on #
DEBUG [2024-12-16T11:20:36.204979] 13742: type/base.rb:412:in `delegate': cloudlinkcenter.rb:205:in `clc_agent_info' calling delegated method os_connect_ip on #
DEBUG [2024-12-16T11:20:38.054169] 13742: type/base.rb:412:in `delegate': cloudlinkcenter.rb:742:in `update_clc_node_agent' calling delegated method os_connect_ip on #
DEBUG [2024-12-16T11:20:38.760221] 13742: provider/cloudlink/cloudlinkcenter.rb:747:in `update_clc_node_agent': clc-10.10.30.20: CLC Server and agent are running on same version 7.1 (build 140)
INFO  [2024-12-16T11:20:38.760840] 13742: service_deployment.rb:5367:in `block in process_firmware_update': Competed CLC agent update on vmcl01-esxi08.dell.lab

相反,不正常的堆疊顯示以下錯誤:
NoMethodError: undefined method '[]' for nil:NilClass
記錄位置:Job-afe400aa-d7fe-4897-9a04-fe08b924c4ae-0-1/deployment.logs

DEBUG [2024-12-19T13:35:48.462150] 19552: service_deployment.rb:5348:in `process_firmware_update': Processing firmware update after selecting resources
DEBUG [2024-12-19T13:35:48.462349] 19552: service_deployment.rb:5353:in `block in process_firmware_update': Processing firmware update on rackserver-xxxxxxx
INFO  [2024-12-19T13:35:48.463044] 19552: service_deployment.rb:5363:in `block in process_firmware_update': Updating CLC Agent update on PFSON04
DEBUG [2024-12-19T13:35:48.463276] 19552: service_deployment.rb:5365:in `block in process_firmware_update': Updating CLC Agent version on node PFSON04
DEBUG [2024-12-19T13:35:48.463622] 19552: type/base.rb:412:in `delegate': service_deployment.rb:5366:in `block in process_firmware_update' calling delegated method update_clc_node_agent on #
DEBUG [2024-12-19T13:35:48.466045] 19552: type/base.rb:412:in `delegate': cloudlinkcenter.rb:205:in `clc_agent_info' calling delegated method os_connect_ip on #
DEBUG [2024-12-19T13:35:51.089302] 19552: type/base.rb:412:in `delegate': cloudlinkcenter.rb:742:in `update_clc_node_agent' calling delegated method os_connect_ip on #
ERROR [2024-12-19T13:35:51.093230] 19552: service_deployment.rb:5535:in `process_firmware_update': Encountered an error during firmware update: NoMethodError: undefined method `[]' for nil:NilClass

此外,升級工作記錄會擷取工作失敗的確切瞬間:

 記錄位置:Job-afe400aa-d7fe-4897-9a04-fe08b924c4ae-0-1/deployment.logs

DEBUG [2024-12-19T13:37:23.210005] 19552: service_deployment.rb:6485:in `finalize_firmware_update': Update complete: false, in protected maintenance mode false
ERROR [2024-12-19T13:37:23.210184] 19552: service_deployment.rb:6491:in `finalize_firmware_update': Failed to update the server!
INFO  [2024-12-19T13:37:23.210321] 19552: service_deployment.rb:6496:in `finalize_firmware_update': Firmware update status: Error
ERROR [2024-12-19T13:37:23.216294] 19552: service_deployment.rb:622:in `process': Firmware update failed for Job-afe400aa-d7fe-4897-9a04-fe08b924c4ae-0-2
ERROR [2024-12-19T13:37:23.216535] 19552: service_deployment.rb:623:in `process': ["/opt/asm-deployer/lib/asm/service_deployment.rb:6500:in `finalize_firmware_update'", "/opt/asm-deployer/lib/asm/service_deployment.rb:5549:in `process_firmware_update'", "/opt/asm-deployer/lib/asm/service_deployment.rb:479:in `process'", "/opt/asm-deployer/lib/asm.rb:228:in `block in process_deployment'"]
INFO  [2024-12-19T13:37:23.216961] 19552: service_deployment.rb:625:in `process': Status: Error

 

影響

無法升級 SO 節點。

Cause

記錄顯示 PowerFlex Manager 無法繼續執行update_clc_node_agent工作,因為它無法在兩個 CLC 裝置中識別正確的「主要」。這會顯示在 deployment.logs 的錯誤行上:

ERROR [2024-12-19T13:35:51.093230] 19552: service_deployment.rb:5535:in `process_firmware_update': Encountered an error during firmware update: NoMethodError: undefined method `[]' for nil:NilClass

Resolution

  1. 嘗試更新服務詳細資訊操作 

    1. 在受影響的服務上啟動「更新服務詳細資料」動作。

  2. 在精靈中確認清查摘要

    1. 在此過程中,精靈應顯示清查摘要,表示正在移除一個 CLC 裝置,並正在新增另一個。
    2. 這可確認目前的 CLC 不是主要裝置,且要新增的裝置是正確的主要裝置
      CloudLink 
  3. 完成更新服務詳細資料程序

    1. 按照嚮導的指導完成「更新服務詳細資訊」操作。
  4. 繼續升級

    1. 重試升級。它現在應該毫無問題地進行。

受影響的版本

PowerFlex Manager 3.x

Affected Products

PowerFlex appliance connectivity

Products

PowerFlex rack, VxFlex Ready Nodes, PowerFlex custom node, PowerFlex appliance R650, PowerFlex appliance R6525, PowerFlex appliance R660, PowerFlex appliance R6625, Powerflex appliance R750, PowerFlex appliance R760, PowerFlex appliance R7625 , PowerFlex appliance R640, PowerFlex appliance R740XD, PowerFlex appliance R7525, PowerFlex appliance R840 ...
Article Properties
Article Number: 000275691
Article Type: Solution
Last Modified: 22 Jan 2025
Version:  1
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.