How to decode raw data on ECC memory errors for the PowerEdge C1100, C2100, C6100, C6105, C6145, C6220 and C8220.

How to decode raw data on ECC memory errors for the PowerEdge C1100, C2100, C6100, C6105, C6145, C6220 and C8220.



Article Summary: This article describes how to decode raw data on ECC memory errors in the System Event Log on the PowerEdge C1100, C2100, C6100, C6105, and C6145.


Issue:

For PowerEdge C-series servers use the attached excel tool to decode the SEL RAW data to figure out which DIMM slot is/was having problem.

Note: This tool is only for decoding memory ECC errors.


Solution:

How to collect the RAW SEL data:

  1. Use IPMI tool to get SEL RAW event data with the following parameter
    #ipmitool sel list -v (Find out the memory related RAW event data from output.)
  2. Download the excel file and open it.
  3. Select the platform you’re working on from the drop-down list and fill in the RAW event data which you got in step#1 in Cell#B2 and press Enter.
  4. It will tell you which DIMM slot has a problem.
Note: Use the impitool sel list -v command to get Event Raw Data.

Example one of SEL record:

SEL Record ID : 0021
Record Type : 02
Timestamp : 02/08/2012 12:40:58
Generator ID : 0021
EvM Revision : 04
Sensor Type : Memory
Sensor Number : 60
Event Type : Sensor-specific Discrete
Event Direction : Assertion Event
Event Data : a1ff14 (This is the Event Raw data you need to fill the 6 bytes into cell#B2.)
Description : Uncorrectable ECC




文章 ID: SLN156243

上次修改日期: 03/07/2016 09:59 AM


為本文評分

準確
實用
易懂
這篇文章對您有用嗎?
傳送意見反應
評語中不得包含下列特殊字元:<>()\
很抱歉,我們的意見回饋系統目前關閉中。請稍後再試。

感謝您的寶貴意見。