ECS:xDoctor:RAP015:风扇故障

Summary: 风扇报告节点上的故障状态。

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

节点上的风扇已停止报告其处于工作状态。
风扇报告它处于故障状态,如果不解决,可能会导致节点过热。  
----------------------
CRITICAL - Fan Failure
----------------------
Node      = 169.254.1.3
Extra     = {'node': '169.254.1.3', 'item': 'Fan1', 'status': 'CRIT', 'fan': 'System Board', 'info': '0 RPM, below critical threshold'}
RAP       = RAP015
Solution  = KB 470284
Timestamp = 2023-08-22_102927
PSNT      = CKMxxxxxxxxxxx @ 4.8-92.0
----------------------
CRITICAL - Fan Failure
----------------------
Node      = 169.254.1.3
Extra     = {'node': '169.254.1.3', 'item': 'Fan Redundancy', 'status': 'FAIL', 'fan': 'System Board', 'info': 'not redundant, redundancy lost'}
RAP       = RAP015
Solution  = KB 470284
Timestamp = 2023-08-22_102927
PSNT      = CKMxxxxxxxxxxx @ 4.8-92.0

Cause

由于电气故障或老化导致的活动部件故障。

Resolution

连接到有问题的节点并验证问题:
使用 cs_hal 验证报告的节点上的风扇状态。
 

发出命令:# cs_hal传感器风扇
 
运行状况良好的风扇示例:
admin@node3:~> cs_hal sensors fan
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
System Board      Fan                 Fan1              OK      12600 RPM
System Board      Fan                 Fan2              OK      12600 RPM
System Board      Fan                 Fan3              OK      16920 RPM
System Board      Fan                 Fan4              OK      16920 RPM
System Board      Fan                 Fan5              OK      17040 RPM
System Board      Fan                 Fan6              OK      17040 RPM
System Board      Fan                 Fan Redundancy    OK      fully redundant;

如下图所示,其他风扇的速度增加,产生更多的气流,使节点保持冷却,直到更换风扇为止。

风扇故障的示例:

admin@node3:~> cs_hal sensors fan
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
System Board      Fan                 Fan1              CRIT    0 RPM; below critical threshold
System Board      Fan                 Fan2              OK      13440 RPM
System Board      Fan                 Fan3              OK      20640 RPM
System Board      Fan                 Fan4              OK      20880 RPM
System Board      Fan                 Fan5              OK      20880 RPM
System Board      Fan                 Fan6              OK      20640 RPM
System Board      Fan                 Fan Redundancy    FAIL    not redundant; redundancy lost;
发出命令:# sudo ipmitool -H <BMC_IP> -U root -P passwd -I lanplus sdr type “Fan”
admin@node3:~> sudo ipmitool -H 192.168.219.103 -U root -P passwd -I lanplus sdr type "Fan"
Fan1             | 38h | lcr |  7.1 | 0 RPM     
Fan2             | 39h | ok  |  7.1 | 13440 RPM
Fan3             | 3Ah | ok  |  7.1 | 20640 RPM
Fan4             | 3Bh | ok  |  7.1 | 20880 RPM
Fan5             | 3Ch | ok  |  7.1 | 20880 RPM
Fan6             | 3Dh | ok  |  7.1 | 20640 RPM
Fan Redundancy   | 78h | ok  |  7.1 | Redundancy Lost
发出命令:# sudo -i racadm getsensorinfo
(cutting the complete output.)
admin@node3:~> sudo -i racadm getsensorinfo

Sensor Type : FAN
<Sensor Name>                   <Status>    <Reading>   <lc>        <uc>        <PWM %>     <Type>
System Board Fan1               Failed      0RPM        480RPM      NA          NA          High Performance 
System Board Fan2               Ok          13440RPM    480RPM      NA          65%         High Performance
System Board Fan3               Ok          20640RPM    480RPM      NA          100%        High Performance
System Board Fan4               Ok          20640RPM    480RPM      NA          100%        High Performance
System Board Fan5               Ok          20880RPM    480RPM      NA          100%        High Performance
System Board Fan6               Ok          20640RPM    480RPM      NA          100%        High Performance


<Sensor Name>                   <Status>                 <Type>
System Board PS Redundancy      Full Redundant           PSU
System Board Fan Redundancy     Redundancy Lost          Fan 

如果任何风扇的状态为“OK”,报告 CRIT,则风扇发生故障,应进行更换。如果您没有针对此问题的服务请求 (SR),请向 ECS 支持人员创建一个服务请求(请参阅此知识库文章)。

Affected Products

ECS Appliance Hardware Gen1 U-Series

Products

ECS Appliance
Article Properties
Article Number: 000041145
Article Type: Solution
Last Modified: 30 Sep 2025
Version:  5
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.