IDPA:PowerProtect DP Rapid Upgrade Checker 回報韌體升級預先檢查失敗
Summary: PowerProtect Data Protection (DP) 系列裝置和 IDPA:PowerProtect DP Rapid Upgrade Checker (RUCK) 回報韌體升級預先檢查失敗。 The cached response with Node Event Service is disabled.節點事件服務處於降級狀態,iDRAC Service Module 目前不可用/使用中。檢查 iDRAC Service Module/iDRAC 狀態。 這可能是因為 PCIe 固態硬碟插槽 4 中的 NVMe PM1735 卡韌體過期所造成。 ...
Symptoms
PowerProtect Data Protection Rapid Upgrade Checker (RUCK) 失敗,並傳回下列錯誤:
+-----------+--------------------+--------+-----------------------------------------------+-----------------------------------------------+ | Component | Check | Status | Message | Remedy | +-----------+--------------------+--------+-----------------------------------------------+-----------------------------------------------+ | ESXi | firmware_readiness | FAILED | Firmware pre-upgrade checks failed. [ xxx.xxx.| Use KB https://www.dell.com/support/kbdoc/191 | | | | | xxx.xxx ]: The cached response with Node Event| 627 to fix the issue. | | | | | Service is disabled. Node Event Service is in | | | | | | a degraded state,iDRAC Service Module is not | | | | | | available/active at this time. Check iDRAC S | | | | | | ervice Module/iDRAC status. | | +-----------+--------------------+--------+-----------------------------------------------+-----------------------------------------------+
[ERROR] Firmware pre-upgrade checks failed. [ <ESXi IP Address> ]: The cached response with Node Event Service is disabled. Node Event Service is in a degraded state, iDRAC Service Module is not available/active at this time. Check iDRAC Service Module/iDRAC status.
插槽 4 中的 PCIe SSD 硬體型號為 Dell Enterprise NVMe AGN MU AIC 1.6 TB 卡 (NVMe PM1735),韌體版本低於 2.3.0。
您可以在 iDRAC 上找到 PCIe SSD 卡型號和韌體版本。系統 > 實體磁碟 > PCIe 裝置 - 插槽 4 磁碟 1
中的 PCIe SSD 在此範例中,PCIe SSD 卡為 PM1725b。因此,因應措施不適用於此情況:

圖 1:如何從 iDRAC 使用者介面找到 PCIe SSD 卡。
在 Dell iDRAC 技術支援報告 (TSR) 記錄中,PCIe 固態硬碟卡命名為「Dell Ent NVMe AGN MU AIC 1.6TB」:

圖 2:在 TSR 記錄中,NVMe PM1735 卡被命名為 Dell Ent NVMe AGN MU AIC 1.6 TB。
iDRAC Service Module (iSM) 在 iDRAC 使用者介面中顯示為執行中:

圖 3:iSM 狀態正在執行中。
Dell PT 代理程式和 iSM 正在執行,且在重新開機後問題仍然存在:
[root@ESXi:/opt/dell/DellPTAgent/tools] /etc/init.d/DellPTAgent status DellPTAgent is running [root@ESXi:/opt/dell/DellPTAgent/tools] /etc/init.d/DellPTAgent stop Stopping DellPTAgent... watchdog-DellPTAgent: Terminating watchdog process with PID 7447869 DellPTAgent stopped [root@ESXi:/opt/dell/DellPTAgent/tools] /etc/init.d/DellPTAgent start Starting DellPTAgent... DellPTAgent started [root@ESXi:/opt/dell/DellPTAgent/tools] /etc/init.d/DellPTAgent status DellPTAgent is running [root@ESXi:/opt/dell/DellPTAgent/tools] [root@ESXi:/opt/dell/DellPTAgent/tools] /etc/init.d/dcism-netmon-watchdog status iSM is active (running) [root@ESXi:/opt/dell/DellPTAgent/tools] /etc/init.d/dcism-netmon-watchdog stop Module dcism_module successfully unloaded [root@ESXi:/opt/dell/DellPTAgent/tools] /etc/init.d/dcism-netmon-watchdog start [root@ESXi:/opt/dell/DellPTAgent/tools] /etc/init.d/dcism-netmon-watchdog status iSM is active (running) [root@ESXi:/opt/dell/DellPTAgent/tools]
Cause
在 ACM 中,顯示 iDRAC 快取已降級:
ACM:/tmp # dpacli -agentinfo
{"Agent Info": [{"ESXiHost-xxx.xxxx.xxx.xxx": {
"ptagentversion": "2.4.1-3",
"uptime": "54 seconds ( 54 seconds )",
"ism_version": "3.6.0",
"system_uuid": "618axxxx-xxxx-xxxx-xxxx-xxxxxxxxdd70",
"process_id": "7440642",
"host_epoch_time": "1646617738.156490 (secs.usecs)",
"model": "DP4400",
"name": "<ESXi_hostname>",
"mfr": "Dell Inc.",
"domain": "<ESXi FQDN>",
"servicetag": "XXXXXXX",
"os": "VMWare ESXi",
"os_version": "6.7.0 build-17700523",
"rest_endpoints": "https://127.0.0.1:8086,https://192.168.100.101:8086",
"api_blocking_enabled": "false",
"TPM Present": "false",
"MarvellLibraryVersion": "5.0.13.1109",
"libstorelib.so": "07.07",
"libstorelibir-3.so": "15.03-0",
"libstorelibit.so": "07.05",
"idrac_pass_thru_ip": "169.254.0.1",
"idrac_ethernet_ip": "xx.xxx.xxx.x",
"host_pass_thru_ip": "169.254.0.2",
"default_server_cert": "true",
"status": {
"idraccache": "Degraded",
"idracConnection": "OK",
"iSM": "N/A",
"agent": "Degraded"
}
}}]}
ACM:/tmp #
從 ESXi 提取時,會看到相同的資訊:
[root@ESXi:~] /opt/dell/DellPTAgent/tools/pta_call get agent/info
Request sent to DellPTAgent @ https://192.168.100.101:8086
{
"TPM Present": "false",
"MarvellLibraryVersion": "5.0.13.1109",
"uptime": "344 seconds ( 5 minutes 44 seconds )",
"system_uuid": "618axxxx-xxxx-xxxx-xxxx-xxxxxxxxdd70",
"host_pass_thru_ip": "169.254.0.2",
"servicetag": "XXXXXXX",
"domain": "<ESXi FQDN>",
"default_server_cert": "true",
"libstorelibir-3.so": "15.03-0",
"model": "DP4400",
"idrac_ethernet_ip": "xxx.xxx.xxx.xxx",
"os": "VMWare ESXi",
"rest_endpoints": "https://127.0.0.1:8086,https://192.168.100.101:8086",
"mfr": "Dell Inc.",
"api_blocking_enabled": "false",
"libstorelib.so": "07.07",
"ptagentversion": "2.4.1-3",
"ism_version": "3.6.0",
"host_epoch_time": "1646618523.10088 (secs.usecs)",
"os_version": "6.7.0 build-17700523",
"libstorelibit.so": "07.05",
"idrac_pass_thru_ip": "169.254.0.1",
"name": "<ESXi_hostname>",
"process_id": "7441193",
"status": {
"idraccache": "Degraded",
"idracConnection": "OK",
"agent": "Degraded",
"iSM": "N/A"
}
}
Response: status: 200 [OK], size: 1067 bytes, latency: 0.145 seconds.
[root@ESXi:~]
在這種情況下,PT 代理程式無法在 hwInventory 因為它無法查詢插槽 4 中 NVMe PM1735 卡的資料。
在 ESXi /scratch/log/pta_debug.log 中,會顯示錯誤訊息:
2022/03/07 04:51:03[UTC] [7459005:193227584] WARN - WSManClient::isValidResponse: Http request to host: 169.254.0.1, failed with status code: -5
2022/03/07 04:51:03[UTC] [7459005:193227584] WARN - getComponentFromInventory: Cannot find component 'PhysicalDisk' in hwInventory:
{
"Message": "Read timed out!",
"ReturnValue": -5
}
2022/03/07 04:52:57[UTC] [7459005:189000512] WARN - NvmeDiscoveredDevice::setIntAttrFromDescriptor: Cannot convert string <ie> to int.
2022/03/07 04:52:57[UTC] [7459005:189000512] WARN - NvmeDiscoveredDevice::setIntAttrFromDescriptor: Failed to read <bay> from <PCIe Slot 4>
因此,建議您將 NVMe 1735 韌體升級至 2.3.0 版,此版本已納入「PowerProtect Data Protection Appliance-IDPA Firmware Gen14 Dec 2021」中。所需的韌體檔案為: Express-Flash-PCIe-SSD_Firmware_RP8RC_WN64_2.3.0_A03.EXE
Resolution
若要將 NVMe 1735 韌體升級至 2.3.0 版:
-
從 Dell 支援 Web 下載「PowerProtect Data Protection 系列應用裝置-IDPA 韌體 Gen14 Dec 2021」,然後解壓縮所需的韌體檔案
Express-Flash-PCIe-SSD_Firmware_RP8RC_WN64_2.3.0_A03.EXE。 -
從 ACM Web 使用者介面關閉 Integrated Data Protection Appliance。
-
確認 ESXi 已關閉電源後,請登入 iDRAC 使用者介面,然後將韌體檔案手動上傳至 iDRAC:

圖 4:如何在 iDRAC 使用者介面中上傳韌體 -
選取
Express-Flash-PCIe-SSD_Firmware_RP8RC_WN64_2.3.0_A03.EXE檔,然後按兩下 安裝並重新啟動。接著 iDRAC 會安裝韌體並重新開機。
圖 5:如何套用韌體 -
重新開機後,可在 iDRAC >系統 > 清查 >韌體清查中確認韌體版本:

圖 6:如何從韌體清查中檢閱韌體 - 監控 DP4400 啟動,以及虛擬機器成功啟動之後
- 執行
dpacli在 ACM 中再次執行命令,並確認idraccache且代理程式狀態應變更為 確定:
ACM:~ # dpacli -agentinfo
{"Agent Info": [{"ESXiHost-xxx.xxxx.xxx.xxx": {
"ptagentversion": "2.4.1-3",
"uptime": "595 seconds ( 9 minutes 55 seconds )",
"ism_version": "3.6.0",
"system_uuid": "618axxxx-xxxx-xxxx-xxxx-xxxxxxxxdd70",
"process_id": "2100391",
"host_epoch_time": "1646800495.522825 (secs.usecs)",
"model": "DP4400",
"name": "<ESXi_hostname>",
"mfr": "Dell Inc.",
"domain": "<ESXi FQDN>",
"servicetag": "XXXXXXX",
"os": "VMWare ESXi",
"os_version": "6.7.0 build-17700523",
"rest_endpoints": "https://127.0.0.1:8086,https://192.168.100.101:8086",
"api_blocking_enabled": "false",
"TPM Present": "false",
"MarvellLibraryVersion": "5.0.13.1109",
"libstorelib.so": "07.07",
"libstorelibir-3.so": "15.03-0",
"libstorelibit.so": "07.05",
"idrac_pass_thru_ip": "169.254.0.1",
"idrac_ethernet_ip": "xxx.xxx.xxx.xxx",
"host_pass_thru_ip": "169.254.0.2",
"default_server_cert": "true",
"status": {
"idraccache": "OK",
"idracConnection": "OK",
"iSM": "N/A",
"agent": "OK"
}
}}]}
ACM:~ #
-
再次執行執行狀況檢查 RUCK 工具,確認問題是否已解決。
Additional Information
「PowerProtect DP 系列應用裝置-IDPA 韌體 Gen14 Dec 2021」韌體套件可從 Dell 支援網頁下載:
圖 7:「PowerProtect DP 系列應用裝置-IDPA 韌體 Gen14 2021 年 12 月」韌體可從 Dell 支援網頁取得。
若要從下列位置擷取韌體: IDPA_Firmware_Gen14_Update_Dec_2021_Package.zip,解壓縮所有套裝檔案:「dpafw-2.312.rpm」 > 「dpafw-2.312-0.noarch.cpio」 > 「IDPA-2.312-2.312.tar.gz」 > 「IDPA-2.312-2.312.tar」,>則可在 DUPS > 下找到韌體檔案 NVME:
圖 8:從「PowerProtect DP 系列應用裝置-IDPA 韌體 Gen14 2021 年 12 月」韌體套件解壓縮 NVMe PM1735 韌體的範例。