PowerFlex Manager — SO 节点更新在功能 update_clc_node_agent 中失败
Summary: PFxM 无法在“update_clc_node_agent”功能期间升级仅存储 (SO) 节点,导致在尝试将 SDS 放入 PMM 时升级作暂停。
This article applies to
This article does not apply to
This article is not tied to any specific product.
Not all product versions are identified in this article.
Symptoms
情况说明
- 环境:高可用性 (HA) CloudLink Center 设备
- 问题:SO 资源组仅列出两个预期 CLC VM 中的一个。
- 症状:升级 SO 节点失败,理由是节点不在 PMM 中。
此情形表示升级失败的触发点。下面是一个示例,说明了在运行 update_clc_node_agent 功能:
运行状况良好的堆栈示例:
日志位置:Job-afe400aa-d7fe-4897-9a04-fe08b924c4ae-0-1/deployment.logs
DEBUG [2024-12-16T11:20:36.199529] 13742: service_deployment.rb:5348:in `process_firmware_update': Processing firmware update after selecting resources DEBUG [2024-12-16T11:20:36.200310] 13742: service_deployment.rb:5353:in `block in process_firmware_update': Processing firmware update on rackserver-xxxxxxx INFO [2024-12-16T11:20:36.201536] 13742: service_deployment.rb:5363:in `block in process_firmware_update': Updating CLC Agent update on vmcl01-esxi08.dell.lab DEBUG [2024-12-16T11:20:36.201933] 13742: service_deployment.rb:5365:in `block in process_firmware_update': Updating CLC Agent version on node svm-vmcl01-esxi08 DEBUG [2024-12-16T11:20:36.202379] 13742: type/base.rb:412:in `delegate': service_deployment.rb:5366:in `block in process_firmware_update' calling delegated method update_clc_node_agent on # DEBUG [2024-12-16T11:20:36.204979] 13742: type/base.rb:412:in `delegate': cloudlinkcenter.rb:205:in `clc_agent_info' calling delegated method os_connect_ip on # DEBUG [2024-12-16T11:20:38.054169] 13742: type/base.rb:412:in `delegate': cloudlinkcenter.rb:742:in `update_clc_node_agent' calling delegated method os_connect_ip on # DEBUG [2024-12-16T11:20:38.760221] 13742: provider/cloudlink/cloudlinkcenter.rb:747:in `update_clc_node_agent': clc-10.10.30.20: CLC Server and agent are running on same version 7.1 (build 140) INFO [2024-12-16T11:20:38.760840] 13742: service_deployment.rb:5367:in `block in process_firmware_update': Competed CLC agent update on vmcl01-esxi08.dell.lab
相反,运行状况不佳的堆栈会显示以下错误:NoMethodError: undefined method '[]' for nil:NilClass
日志位置:Job-afe400aa-d7fe-4897-9a04-fe08b924c4ae-0-1/deployment.logs
DEBUG [2024-12-19T13:35:48.462150] 19552: service_deployment.rb:5348:in `process_firmware_update': Processing firmware update after selecting resources DEBUG [2024-12-19T13:35:48.462349] 19552: service_deployment.rb:5353:in `block in process_firmware_update': Processing firmware update on rackserver-xxxxxxx INFO [2024-12-19T13:35:48.463044] 19552: service_deployment.rb:5363:in `block in process_firmware_update': Updating CLC Agent update on PFSON04 DEBUG [2024-12-19T13:35:48.463276] 19552: service_deployment.rb:5365:in `block in process_firmware_update': Updating CLC Agent version on node PFSON04 DEBUG [2024-12-19T13:35:48.463622] 19552: type/base.rb:412:in `delegate': service_deployment.rb:5366:in `block in process_firmware_update' calling delegated method update_clc_node_agent on # DEBUG [2024-12-19T13:35:48.466045] 19552: type/base.rb:412:in `delegate': cloudlinkcenter.rb:205:in `clc_agent_info' calling delegated method os_connect_ip on # DEBUG [2024-12-19T13:35:51.089302] 19552: type/base.rb:412:in `delegate': cloudlinkcenter.rb:742:in `update_clc_node_agent' calling delegated method os_connect_ip on # ERROR [2024-12-19T13:35:51.093230] 19552: service_deployment.rb:5535:in `process_firmware_update': Encountered an error during firmware update: NoMethodError: undefined method `[]' for nil:NilClass
此外,升级作业日志还会捕获任务失败的确切时刻:
日志位置:Job-afe400aa-d7fe-4897-9a04-fe08b924c4ae-0-1/deployment.logs
DEBUG [2024-12-19T13:37:23.210005] 19552: service_deployment.rb:6485:in `finalize_firmware_update': Update complete: false, in protected maintenance mode false ERROR [2024-12-19T13:37:23.210184] 19552: service_deployment.rb:6491:in `finalize_firmware_update': Failed to update the server! INFO [2024-12-19T13:37:23.210321] 19552: service_deployment.rb:6496:in `finalize_firmware_update': Firmware update status: Error ERROR [2024-12-19T13:37:23.216294] 19552: service_deployment.rb:622:in `process': Firmware update failed for Job-afe400aa-d7fe-4897-9a04-fe08b924c4ae-0-2 ERROR [2024-12-19T13:37:23.216535] 19552: service_deployment.rb:623:in `process': ["/opt/asm-deployer/lib/asm/service_deployment.rb:6500:in `finalize_firmware_update'", "/opt/asm-deployer/lib/asm/service_deployment.rb:5549:in `process_firmware_update'", "/opt/asm-deployer/lib/asm/service_deployment.rb:479:in `process'", "/opt/asm-deployer/lib/asm.rb:228:in `block in process_deployment'"] INFO [2024-12-19T13:37:23.216961] 19552: service_deployment.rb:625:in `process': Status: Error
影响
无法升级 SO 节点。
Cause
日志指示 PowerFlex Manager无法继续执行update_clc_node_agent任务,因为它无法在两个 CLC 设备中识别正确的“主”。这显示在 deployment.logs 的错误行中:
ERROR [2024-12-19T13:35:51.093230] 19552: service_deployment.rb:5535:in `process_firmware_update': Encountered an error during firmware update: NoMethodError: undefined method `[]' for nil:NilClass
Resolution
-
尝试更新服务详细信息作
-
在受影响的服务上启动“Update Service Details”作。
-
-
在向导中验证资源清册摘要
- 在此过程中,向导应显示资源清册摘要,指示正在移除一个 CLC 设备,并正在添加另一个 CLC 设备。
- 这将确认当前 CLC 不是主 CLC,并且要添加的设备是正确的主
-
完成更新服务详细信息流程
- 按照向导的指导完成Update Service Details作。
-
继续升级
- 重试升级。它现在应该可以正常进行。
受影响的版本
PowerFlex Manager 3.x
Affected Products
PowerFlex appliance connectivityProducts
PowerFlex rack, VxFlex Ready Nodes, PowerFlex custom node, PowerFlex appliance R650, PowerFlex appliance R6525, PowerFlex appliance R660, PowerFlex appliance R6625, Powerflex appliance R750, PowerFlex appliance R760, PowerFlex appliance R7625
, PowerFlex appliance R640, PowerFlex appliance R740XD, PowerFlex appliance R7525, PowerFlex appliance R840
...
Article Properties
Article Number: 000275691
Article Type: Solution
Last Modified: 22 Jan 2025
Version: 1
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.