ECS: RAP015: 온도 오류; 증상 코드: 2010

Summary: 노드의 온도 센서가 위험 수준에 도달했다고 보고합니다.

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

온도 센서가 위험 임계값을 초과하는 온도를 감지했습니다.
구성 요소가 올바르게 작동하지 않아 온도 센서가 위험 수준에 도달했음을 보고할 수 있습니다.
노드의 온도 센서가 위험 수준에 도달했음을 보고합니다.

Cause

온도 센서가 위험 수준 이상으로 올라가는 문제가 발생했습니다.

Resolution

Gen2의 경우 맨 아래로 스크롤합니다.

Gen3 하드웨어: 

1. 보고된 노드에서 cs_hal를 사용하여 온도 센서의 상태를 확인합니다.

명령: 
#cs_hal sensors temp
 
예: Gen3의 경우 다음과 같이 3개의 온도 센서만 있습니다.
 
admin@n1-mgmt:~>  cs_hal sensors temp
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      53 Degrees Celsius
Processor         Temperature         Temp              OK      54 Degrees Celsius
System Board      Temperature         Inlet Temp        CRIT    40 Degrees Celsius; above critical threshold
System Board      Temperature         Exhaust Temp      OK      50 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.
admin@n1-mgmt:~>
2. 랙의 모든 노드를 확인하고 다른 노드에서 온도 센서가 "OK"

명령이 아니라고 보고하는지 확인합니다
viprexec -i  cs_hal sensors temp

예: 이 예에서는 랙 상단의 절반에 있는 여러 노드가 입구 온도가 너무 높다고 보고합니다. 
admin@n1-mgmt:~> viprexec -i  cs_hal sensors temp

Output from host : 192.168.219.1
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      53 Degrees Celsius
Processor         Temperature         Temp              OK      53 Degrees Celsius
System Board      Temperature         Inlet Temp        CRIT    40 Degrees Celsius; above critical threshold
System Board      Temperature         Exhaust Temp      OK      50 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.2
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      47 Degrees Celsius
Processor         Temperature         Temp              OK      49 Degrees Celsius
System Board      Temperature         Inlet Temp        CRIT    39 Degrees Celsius; above critical threshold
System Board      Temperature         Exhaust Temp      OK      50 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.3
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      46 Degrees Celsius
Processor         Temperature         Temp              OK      46 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      35 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      47 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.4
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      48 Degrees Celsius
Processor         Temperature         Temp              OK      50 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      35 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      47 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.5
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      48 Degrees Celsius
Processor         Temperature         Temp              OK      50 Degrees Celsius
System Board      Temperature         Inlet Temp        WARN    38 Degrees Celsius; above non-critical threshold
System Board      Temperature         Exhaust Temp      OK      49 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.6
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      50 Degrees Celsius
Processor         Temperature         Temp              OK      52 Degrees Celsius
System Board      Temperature         Inlet Temp        CRIT    39 Degrees Celsius; above critical threshold
System Board      Temperature         Exhaust Temp      OK      51 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.7
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      45 Degrees Celsius
Processor         Temperature         Temp              OK      48 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      36 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      47 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.8
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      51 Degrees Celsius
Processor         Temperature         Temp              OK      49 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      31 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      43 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.9
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      52 Degrees Celsius
Processor         Temperature         Temp              OK      51 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      30 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      42 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.10
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      54 Degrees Celsius
Processor         Temperature         Temp              OK      51 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      28 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      41 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.
 192.168.219.7
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      45 Degrees Celsius
Processor         Temperature         Temp              OK      48 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      36 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      47 Degrees Celsius

Output from host : 192.168.219.11
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      56 Degrees Celsius
Processor         Temperature         Temp              OK      55 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      27 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      40 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.12
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      59 Degrees Celsius
Processor         Temperature         Temp              OK      59 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      26 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      38 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.13
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      51 Degrees Celsius
Processor         Temperature         Temp              OK      49 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      26 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      36 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.14
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      57 Degrees Celsius
Processor         Temperature         Temp              OK      60 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      26 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      38 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.15
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      59 Degrees Celsius
Processor         Temperature         Temp              OK      59 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      26 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      39 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.

Output from host : 192.168.219.16
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
Processor         Temperature         Temp              OK      56 Degrees Celsius
Processor         Temperature         Temp              OK      56 Degrees Celsius
System Board      Temperature         Inlet Temp        OK      26 Degrees Celsius
System Board      Temperature         Exhaust Temp      OK      38 Degrees Celsius

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.
admin@n1-mgmt:~>

3. 가능한 시나리오:
  1. 센서 이상만 보고하는 노드 1개: 온도가 "양호"하지 않다고 보고되는 노드 하나에서만 문제가 발생하는 경우 랙 문제라기보다는 내부 문제로 인해 부품 문제이거나 노드의 공기 흐름이 원활하지 못할 가능성이 높습니다.
  2. 여러 노드 가 영향을 받을 수 있으며, 이는 랙 자체 내의 환경 문제나 데이터 센터 내부의 문제가 될 수 있습니다.


4. 팬이 정상적으로 실행되고 있는지 확인하십시오. 그렇지 않은 경우 팬을 교체해야 할 수 있습니다.

명령:

#cs_hal sensors fan
예: 
admin@ecs:~>cs_hal sensors fan

Output from host : 192.168.219.1
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
System Board      Fan                 Fan1              OK      12600 RPM
System Board      Fan                 Fan2              OK      12600 RPM
System Board      Fan                 Fan3              OK      16920 RPM
System Board      Fan                 Fan4              OK      16800 RPM
System Board      Fan                 Fan5              OK      17040 RPM
System Board      Fan                 Fan6              OK      16920 RPM
System Board      Fan                 Fan Redundancy    OK      fully redundant;

NOTE: on Axum and EX-series, use "sudo -i racadm getsensorinfo" to obtain sensor information.
3. 모든 팬이 정상이라고 보고되는 경우 팬 시스템에 문제가 없음을 의미합니다. Power Edge 팀에 문의하여 부품 교체가 필요한지 확인합니다. 문제를 보고하는 팬이 있는 경우 ECS를 따릅니다. 다이얼 홈: 팬 고장; 증상 코드: 2008

년4. 중요:  https://central.dell.com/case-lookup/ 사용하여 PSNT(Product Serial Number Tag)를 조회하여 내역을 확인합니다.   지난 3-6개월 동안 발생한 발생 횟수를 확인합니다. 문제가 지속적이고 여러 노드에 영향을 미치는지 또는 전체 랙의 유입 온도가 정상보다 높아 영향을 받는지 확인합니다. 이는 해결해야 할 지속적인 환경 문제를 나타냅니다. 온도 문제를 해결하기 위한 명확한 조치 계획과 결론이 없는 한 케이스를 중복으로 종결하지 마십시오. 

5. PE 팀에서 문제를 찾지 못하거나 기록에 동일한 알림(3개월 이상)에서 많은 발생이 포함된 경우 L2 over Swarm 에 문의하고 작업 주문을 준비하여 CE 에게 영향을 받는 랙 및 노드의 환경 조건을 검토합니다. 
 
2세대: 
 
1. cs_hal를 사용하여 온도 센서의 상태를 확인하십시오.
예:
# cs_hal sensors temp
Entity            Type                Label             Status  Info
-----             -----               -----             -----   -----
System Board      Temperature         SSB Therm Trip    OK
System Board      Temperature         BB Inlet Temp     OK      32 Degrees Celsius
CPU (DCMI Compat) Temperature         HSBP Temp         OK      -222 Degrees Celsius
System Board      Temperature         SSB Temp          OK      60 Degrees Celsius
System Board      Temperature         BB BMC Temp       OK      51 Degrees Celsius
System Board      Temperature         P1 VR Temp        OK      38 Degrees Celsius
System Board      Temperature         IB Temp           OK      46 Degrees Celsius
System Board      Temperature         Exit Air Temp     OK      54 Degrees Celsius
Front Panel       Temperature         IOM Temp          OK      43 Degrees Celsius
Drive Backplane   Temperature         HSBP PSOC         OK      37 Degrees Celsius
Front Panel       Temperature         LAN NIC Temp      OK      67 Degrees Celsius
Power Supply      Temperature         PS1 Temperature   OK      34 Degrees Celsius
Power Supply      Temperature         PS2 Temperature   OK      34 Degrees Celsius
Processor         Temperature         P1 Therm Margin   OK      216 Degrees Celsius
Processor         Temperature         P2 Therm Margin   OK      206 Degrees Celsius
Processor         Temperature         P1 Therm Ctrl %   OK      0 Unspecified
Processor         Temperature         P2 Therm Ctrl %   OK      0 Unspecified
Processor         Temperature         P1 DTS Therm Mgn  OK      216 Degrees Celsius
Processor         Temperature         P2 DTS Therm Mgn  OK      206 Degrees Celsius
Processor         Temperature         P1 VRD Hot        OK
Processor         Temperature         P2 VRD Hot        OK
System Board      Temperature         DIMM Thrm Mrgn 1  OK      201 Degrees Celsius
System Board      Temperature         DIMM Thrm Mrgn 2  OK      200 Degrees Celsius
System Board      Temperature         DIMM Thrm Mrgn 3  OK      198 Degrees Celsius
System Board      Temperature         DIMM Thrm Mrgn 4  OK      197 Degrees Celsius
System Board      Temperature         Agg Thrm Mgn 1    OK      233 Degrees Celsius
2. Gen 3(PowerEdge에 보고하지 않음)에서도 동일한 단계를 따릅니다. Gen 2에 대한 자세한 내용은 향후 업데이트될 예정입니다. 

Affected Products

ECS Appliance

Products

ECS Appliance
Article Properties
Article Number: 000046763
Article Type: Solution
Last Modified: 30 Apr 2024
Version:  6
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.