ECS: xDoctor: RAP015: Fan Failure

Summary: A fan reports a failure state on the node.

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

A fan on the node has stopped reporting that it is in working condition.
A fan has reported that it is in a failure state and could cause overheating on the node if not addressed.  
----------------------
CRITICAL - Fan Failure
----------------------
Node      = 169.254.1.3
Extra     = {'node': '169.254.1.3', 'item': 'Fan1', 'status': 'CRIT', 'fan': 'System Board', 'info': '0 RPM, below critical threshold'}
RAP       = RAP015
Solution  = KB 470284
Timestamp = 2023-08-22_102927
PSNT      = CKMxxxxxxxxxxx @ 4.8-92.0
----------------------
CRITICAL - Fan Failure
----------------------
Node      = 169.254.1.3
Extra     = {'node': '169.254.1.3', 'item': 'Fan Redundancy', 'status': 'FAIL', 'fan': 'System Board', 'info': 'not redundant, redundancy lost'}
RAP       = RAP015
Solution  = KB 470284
Timestamp = 2023-08-22_102927
PSNT      = CKMxxxxxxxxxxx @ 4.8-92.0

Cause

Moving component failure due to electrical fault or age.

Resolution

Connect to the node in question and verify the issue:
Verify the fan status on the reported node using cs_hal.
 

Issue command: # cs_hal sensors fan
 
Example of a healthy fan:
admin@node3:~> cs_hal sensors fan
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
System Board      Fan                 Fan1              OK      12600 RPM
System Board      Fan                 Fan2              OK      12600 RPM
System Board      Fan                 Fan3              OK      16920 RPM
System Board      Fan                 Fan4              OK      16920 RPM
System Board      Fan                 Fan5              OK      17040 RPM
System Board      Fan                 Fan6              OK      17040 RPM
System Board      Fan                 Fan Redundancy    OK      fully redundant;

Shown below, the speed of the other fans increases creating more air flow to keep the node cool until the fan has been replaced.

Example of a failed fan:

admin@node3:~> cs_hal sensors fan
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
System Board      Fan                 Fan1              CRIT    0 RPM; below critical threshold
System Board      Fan                 Fan2              OK      13440 RPM
System Board      Fan                 Fan3              OK      20640 RPM
System Board      Fan                 Fan4              OK      20880 RPM
System Board      Fan                 Fan5              OK      20880 RPM
System Board      Fan                 Fan6              OK      20640 RPM
System Board      Fan                 Fan Redundancy    FAIL    not redundant; redundancy lost;
Issue command: # sudo ipmitool -H <BMC_IP> -U root -P passwd -I lanplus sdr type "Fan"
admin@node3:~> sudo ipmitool -H 192.168.219.103 -U root -P passwd -I lanplus sdr type "Fan"
Fan1             | 38h | lcr |  7.1 | 0 RPM     
Fan2             | 39h | ok  |  7.1 | 13440 RPM
Fan3             | 3Ah | ok  |  7.1 | 20640 RPM
Fan4             | 3Bh | ok  |  7.1 | 20880 RPM
Fan5             | 3Ch | ok  |  7.1 | 20880 RPM
Fan6             | 3Dh | ok  |  7.1 | 20640 RPM
Fan Redundancy   | 78h | ok  |  7.1 | Redundancy Lost
Issue command: # sudo -i racadm getsensorinfo
(cutting the complete output.)
admin@node3:~> sudo -i racadm getsensorinfo

Sensor Type : FAN
<Sensor Name>                   <Status>    <Reading>   <lc>        <uc>        <PWM %>     <Type>
System Board Fan1               Failed      0RPM        480RPM      NA          NA          High Performance 
System Board Fan2               Ok          13440RPM    480RPM      NA          65%         High Performance
System Board Fan3               Ok          20640RPM    480RPM      NA          100%        High Performance
System Board Fan4               Ok          20640RPM    480RPM      NA          100%        High Performance
System Board Fan5               Ok          20880RPM    480RPM      NA          100%        High Performance
System Board Fan6               Ok          20640RPM    480RPM      NA          100%        High Performance


<Sensor Name>                   <Status>                 <Type>
System Board PS Redundancy      Full Redundant           PSU
System Board Fan Redundancy     Redundancy Lost          Fan 

If any fan has a status that is not OK reporting CRIT, then the fan has failed and should be replaced. If you do not have a service request (SR) for this issue, open one with ECS support referring to this KB.

Affected Products

ECS Appliance Hardware Gen1 U-Series

Products

ECS Appliance
Article Properties
Article Number: 000041145
Article Type: Solution
Last Modified: 30 Sep 2025
Version:  5
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.