ECS:xDoctor:RAP015:風扇故障

Summary: 風扇回報節點上的故障狀態。

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

節點上的風扇已停止報告其處於正常運作狀態。
風扇回報其處於故障狀態,如果未解決此問題,可能會導致節點過熱。  
----------------------
CRITICAL - Fan Failure
----------------------
Node      = 169.254.1.3
Extra     = {'node': '169.254.1.3', 'item': 'Fan1', 'status': 'CRIT', 'fan': 'System Board', 'info': '0 RPM, below critical threshold'}
RAP       = RAP015
Solution  = KB 470284
Timestamp = 2023-08-22_102927
PSNT      = CKMxxxxxxxxxxx @ 4.8-92.0
----------------------
CRITICAL - Fan Failure
----------------------
Node      = 169.254.1.3
Extra     = {'node': '169.254.1.3', 'item': 'Fan Redundancy', 'status': 'FAIL', 'fan': 'System Board', 'info': 'not redundant, redundancy lost'}
RAP       = RAP015
Solution  = KB 470284
Timestamp = 2023-08-22_102927
PSNT      = CKMxxxxxxxxxxx @ 4.8-92.0

Cause

由於電氣故障或老化導致移動元件故障。

Resolution

連線至所報告的節點並確認問題:
使用 cs_hal 確認回報節點的風扇狀態。
 

發出命令:# cs_hal感應器風扇
 
健全風扇的範例:
admin@node3:~> cs_hal sensors fan
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
System Board      Fan                 Fan1              OK      12600 RPM
System Board      Fan                 Fan2              OK      12600 RPM
System Board      Fan                 Fan3              OK      16920 RPM
System Board      Fan                 Fan4              OK      16920 RPM
System Board      Fan                 Fan5              OK      17040 RPM
System Board      Fan                 Fan6              OK      17040 RPM
System Board      Fan                 Fan Redundancy    OK      fully redundant;

如下圖所示,其他風扇的速度提高會產生更多氣流,讓節點保持涼爽,直到風扇更換為止。

風扇故障的範例:

admin@node3:~> cs_hal sensors fan
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
System Board      Fan                 Fan1              CRIT    0 RPM; below critical threshold
System Board      Fan                 Fan2              OK      13440 RPM
System Board      Fan                 Fan3              OK      20640 RPM
System Board      Fan                 Fan4              OK      20880 RPM
System Board      Fan                 Fan5              OK      20880 RPM
System Board      Fan                 Fan6              OK      20640 RPM
System Board      Fan                 Fan Redundancy    FAIL    not redundant; redundancy lost;
發出命令:# sudo ipmitool -H <BMC_IP> -U root -P passwd -I lanplus sdr type “Fan”
admin@node3:~> sudo ipmitool -H 192.168.219.103 -U root -P passwd -I lanplus sdr type "Fan"
Fan1             | 38h | lcr |  7.1 | 0 RPM     
Fan2             | 39h | ok  |  7.1 | 13440 RPM
Fan3             | 3Ah | ok  |  7.1 | 20640 RPM
Fan4             | 3Bh | ok  |  7.1 | 20880 RPM
Fan5             | 3Ch | ok  |  7.1 | 20880 RPM
Fan6             | 3Dh | ok  |  7.1 | 20640 RPM
Fan Redundancy   | 78h | ok  |  7.1 | Redundancy Lost
發出命令:# sudo -i racadm getsensorinfo
(cutting the complete output.)
admin@node3:~> sudo -i racadm getsensorinfo

Sensor Type : FAN
<Sensor Name>                   <Status>    <Reading>   <lc>        <uc>        <PWM %>     <Type>
System Board Fan1               Failed      0RPM        480RPM      NA          NA          High Performance 
System Board Fan2               Ok          13440RPM    480RPM      NA          65%         High Performance
System Board Fan3               Ok          20640RPM    480RPM      NA          100%        High Performance
System Board Fan4               Ok          20640RPM    480RPM      NA          100%        High Performance
System Board Fan5               Ok          20880RPM    480RPM      NA          100%        High Performance
System Board Fan6               Ok          20640RPM    480RPM      NA          100%        High Performance


<Sensor Name>                   <Status>                 <Type>
System Board PS Redundancy      Full Redundant           PSU
System Board Fan Redundancy     Redundancy Lost          Fan 

如果任何風扇的狀態為「OK」,報告 CRIT,則代表風扇已故障,需要更換。如果您沒有此問題的服務要求 (SR),請參閱參閱本 KB 的 ECS 支援,以開啟服務要求。

Affected Products

ECS Appliance Hardware Gen1 U-Series

Products

ECS Appliance
Article Properties
Article Number: 000041145
Article Type: Solution
Last Modified: 30 Sep 2025
Version:  5
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.