Skip to main content
  • Place orders quickly and easily
  • View orders and track your shipping status
  • Enjoy members-only rewards and discounts
  • Create and access a list of your products
  • Manage your Dell EMC sites, products, and product-level contacts using Company Administration.

EDAC Errors in 'messages' Log in RedHat Enterprise Linux (RHEL) and PowerEdge

Summary: This article provides information on issues with EDAC errors in 'messages' log in RedHat Enterprise Linux (RHEL).

This article may have been automatically translated. If you have any feedback regarding its quality, please let us know using the form at the bottom of this page.

Article Content


Symptoms

Issue : 

You may see something like this in your '/var/log/messages' log :
 

Sep 22 17:58:47 hostname kernel: EDAC MC0: CE row 0, channel 0, label "CPU_SrcID#0_Channel#1_DIMM#0": 1 Unknown error(s): memory scrubbing on FATAL area : cpu=0 Err=0008:00c2 (ch=2), addr = 0x1b63f5000 => socket=0, Channel=1(mask=2), rank=0
Sep 22 17:58:47 hostname kernel:
Sep 22 17:58:47 hostname kernel: EDAC MC0: CE row 1, channel 0, label "CPU_SrcID#0_Channel#2_DIMM#0": 1 Unknown error(s): memory scrubbing on FATAL area : cpu=0 Err=0008:00c2 (ch=2), addr = 0x1b63f6000 => socket=0, Channel=2(mask=4), rank=0
Sep 22 17:58:47 hostname kernel:
Sep 22 17:58:47 hostname kernel: EDAC MC0: CE row 1, channel 0, label "CPU_SrcID#0_Channel#2_DIMM#0": 1 Unknown error(s): memory scrubbing on FATAL area : cpu=0 Err=0008:00c2 (ch=2), addr = 0x1b63f6000 => socket=0, Channel=2(mask=4), rank=0
Sep 22 17:58:47 hostname kernel:
Sep 22 17:58:47 hostname kernel: EDAC MC0: CE row 2, channel 0, label "CPU_SrcID#0_Channel#3_DIMM#0": 1 Unknown error(s): memory scrubbing on FATAL area : cpu=0 Err=0008:00c2 (ch=2), addr = 0x1b63f7000 => socket=0, Channel=3(mask=8), rank=0
Sep 22 17:58:47 hostname kernel:
Sep 22 17:58:47 hostname kernel: EDAC MC0: CE row 0, channel 0, label "CPU_SrcID#0_Channel#1_DIMM#0": 1 Unknown error(s): memory scrubbing on FATAL area : cpu=0 Err=0008:00c2 (ch=2), addr = 0x1b63f8000 => socket=0, Channel=1(mask=2), rank=0
Sep 22 17:58:47 hostname kernel:
Sep 22 18:00:08 hostname kernel: sbridge: HANDLING MCE MEMORY ERROR
Sep 22 18:00:08 hostname kernel: CPU 0: Machine Check Exception: 0 Bank 10: 8c00004d000800c2
Sep 22 18:00:08 hostname kernel: TSC 0 ADDR 2763f4000 MISC 900000800081a8c PROCESSOR 0:206d7 TIME 1379887208 SOCKET 0 APIC 0
Sep 22 18:00:08 hostname kernel: sbridge: HANDLING MCE MEMORY ERROR
Sep 22 18:00:08 hostname kernel: CPU 0: Machine Check Exception: 0 Bank 10: 8c00004d000800c2
Sep 22 18:00:08 hostname kernel: TSC 0 ADDR 2763f4000 MISC 900002000201a8c PROCESSOR 0:206d7 TIME 1379887208 SOCKET 0 APIC 0
Sep 22 18:00:08 hostname kernel: sbridge: HANDLING MCE MEMORY ERROR
Sep 22 18:00:08 hostname kernel: CPU 0: Machine Check Exception: 0 Bank 10: 8c00004d000800c2
Sep 22 18:00:08 hostname kernel: TSC 0 ADDR 2763f4000 MISC 900000400041a8c PROCESSOR 0:206d7 TIME 1379887208 SOCKET 0 APIC 0
Sep 22 18:00:08 hostname kernel: sbridge: HANDLING MCE MEMORY ERROR
Sep 22 18:00:08 hostname kernel: CPU 0: Machine Check Exception: 0 Bank 10: 8c00004d000800c2
Sep 22 18:00:08 hostname kernel: TSC 0 ADDR 2763f4000 MISC 900000400041a8c PROCESSOR 0:206d7 TIME 1379887208 SOCKET 0 APIC 0
Sep 22 18:00:08 hostname kernel: sbridge: HANDLING MCE MEMORY ERROR
Sep 22 18:00:08 hostname kernel: CPU 0: Machine Check Exception: 0 Bank 10: 8c00004d000800c2
Sep 22 18:00:08 hostname kernel: TSC 0 ADDR 2763f5000 MISC 900001000101a8c PROCESSOR 0:206d7 TIME 1379887208 SOCKET 0 APIC 0

 

Cause

Discussion :
 
These errors occur when the Error Detection and Correction (EDAC) module reads the registers from the chipset. You may not notice any memory or CPU errors in the ESM/BMC/IPMI/iDRAC log because the registers are read-once and when enabled, EDAC will get them first.

 

Resolution

Resolution :

 

  • Blacklist the edac driver :
    • List edac modules :
      • # lsmod | grep -i edac
    • Take the output and blacklist them :
    • Edit '/etc/modprobe.d/blacklist.conf' with your favorite editor
    • Add the modules at the bottom of the file
    • Example :
      • blacklist i7core_edac
      • blacklist edac_core
  • Reboot
  • Run hardware diagnostics

 

Article Properties


Affected Product

Servers, PowerEdge, Software RAID with Red Hat Enterprise Linux4, Red Hat Enterprise Linux Version 5, Red Hat Enterprise Linux Version 6, Red Hat Enterprise Linux Version 7, Red Hat Enterprise Virtualization 3

Last Published Date

03 Feb 2022

Version

5

Article Type

Solution