VxRail: Node triggers high inlet temperature alert

Summary: VxRail node reports high Inlet temperature alerts. This is usually due to an environment factor such as air-conditioner problem.

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

The VxRail node triggers these alerts in the life cycle controller:

2024-06-03 02:18:00    2586    TMPS0103    Inlet temperature is above critical level for extended duration.
2024-05-07 08:41:37    355    TMP0121    The system inlet temperature is greater than the upper critical threshold.


The event log generates the matching event entries:

2024-05-07 04:49:36    7    The system inlet temperature is within range.
2024-05-07 04:47:19    6    The system inlet temperature is greater than the upper warning threshold.
2024-05-06 19:41:37    5    The system inlet temperature is greater than the upper critical threshold.
2024-05-06 19:12:49    4    The system inlet temperature is greater than the upper warning threshold.


If the server is under the critical event, it would be automatically running in a degraded mode. If the situation lasts a long time, it shuts down.

In this screenshot, the iDRAC log would read the temperature on the CPU or system board along with their warning and critical threshold. 38 as warning and 42 as critical.

Screenshot of temperature thresholds 

Cause

This is because of the environmental situation that the ventilation is not good. This causes the VxRail node to generate a high temperature. When the fan module is unable to adjust the speed to cool down the internal component temperature, the thermal event causes the server to run in a degraded mode and shuts down the server to avoid hardware damage. This operation depends on the setting of temperature alert setting in the iDRAC.


Inlet high temperature: If the temperature alert is not set, then when it reaches to 42 degrees or above for an extended time it first runs in degraded mode and tries to use the fan module to cool down the server. After an extended time, it shuts down the server.
 

Resolution

  1. VxRail nodes have an internal mechanism to deal with the poor environmental situation with its fan module and with the definition thresholds of warning and critical. As mentioned above after running into critical:

    A. Under iDRAC->configuration->system settings->alert configuration->alerts->alert configuration -> expand the temperature. If the first line critical is with Power off, after reaching the critical temperature it would immediately shut down by CPU thermal trip.

    The following iDRAC command would turn out to have the same effect:
racadm>>racadm eventfilters get -c idrac.alert.system.TMP.critical




Screenshot of the temperature actions in iDRAC 
B. If this parameter is No Action, the iDRAC tries to adjust the fan module to cool down the system first. After it has been running extended cycles, a CPU thermal trip would power off the server to avoid hardware component damage by continuous temperature.

2. To avoid this high Inlet temperature, customers must ensure that inlet temperatures are within in range for optimum performance.

Affected Products

VxRail, VxRail Appliance Series, VxRail G Series Nodes, VxRail D560, VxRail D560F, VxRail E560, VxRail E560 VCF, VxRail E560F, VxRail E560F VCF, VxRail E560N, VxRail E560N VCF, VxRail E660, VxRail E660F, VxRail E660N, VxRail E665, VxRail E665F , VxRail E665N, VxRail G560, VxRail G560 VCF, VxRail G560F, VxRail G560F VCF, VxRail P Series Nodes, VxRail P470, VxRail P570, VxRail P570 VCF, VxRail P570F, VxRail P570F VCF, VxRail P580N, VxRail P580N VCF, VxRail P670F, VxRail P670N, VxRail P675F, VxRail P675N, VxRail S Series Nodes, VxRail S570, VxRail S570 VCF, VxRail S670, VxRail V Series Nodes, VxRail V570, VxRail V570 VCF, VxRail V570F, VxRail V570F VCF, VXRAIL V670F, VxRail VD-4510C, VxRail VD-4520C, VxRail VD Series Nodes, VxRail VE-660, VxRail VE-6615, VxRail VP-760, VxRail VP-7625, VxRail VS-760 ...
Article Properties
Article Number: 000227616
Article Type: Solution
Last Modified: 09 May 2025
Version:  3
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.