ECS:由於 CPU 上收到 NMI,導致意外重新開機
Summary: 由於 CPU 上收到 NMI,導致意外重新開機。
This article applies to
This article does not apply to
This article is not tied to any specific product.
Not all product versions are identified in this article.
Symptoms
ECS 節點多次發生未預期的重新開機,核心檔案已在重新開機時產生。透過在 /var/crash/ 中檢查 dmesg 記錄中的堆疊追蹤,重新開機是由於在 CPU 上偵測到 NMI。
NMI 標準適用於不可遮罩中斷(具有最高優先順序的中斷),它發生是為了發出對不可恢復硬體錯誤的關注信號。
2020-03-01-21:06/dmesg.txt:[5200025.129135] Uhhuh. NMI received for unknown reason 3d on CPU 0. 2020-03-01-21:06/dmesg.txt-[5200025.129135] Do you have a strange power-saving mode enabled? Checked the hardware for any issue and checked if BIOS is out-dated sudo bash memory.sh sudo ipmitool sel list sudo xdoctor /usr/share/emc-intel-firmware/flashupdt/flashupdt /i | grep "BIOS Version"
Cause
這可能是作業系統或硬體問題。
Resolution
重新建立映像可能就足夠了。但是,若重新映像後問題持續存在,最好實體更換節點。
Affected Products
Elastic Cloud StorageProducts
Elastic Cloud StorageArticle Properties
Article Number: 000081969
Article Type: Solution
Last Modified: 12 Sep 2025
Version: 5
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.