10 Elder

 • 

6.2K Posts

July 6th, 2018 11:00

Hello

When troubleshooting a multi-bit memory error you are trying to determine if it is a memory module, slot, or memory controller issue. The memory controller is on the CPU. On 11th generation and older servers the memory logs must be manually cleared. If you don't clear the memory logs then you may have false errors reported.

I suggest swapping the memory in slots B1-B3 with the memory in slots A1-A3. After you perform the memory swap clear the memory log. Monitor the hardware log to see if any new errors are reported. If the error follows the modules then you will need to split up the three modules and continue moving memory around to try to locate the faulty module. If the error stays with the slot then it is likely a slot issue. If the error moves around to different slots but stays on the B lane then it is likely an issue with the memory controller on CPU2. You can swap CPUs to see if the error moves with the CPU to the A lane.

You should clear the memory logs each time you move memory or CPUs for troubleshooting. If you don't clear the memory logs then old errors may continue to be reported. Here is an article with instructions for clearing the memory logs.

<ADMIN NOTE: Broken link has been removed from this post by Dell>

Thanks

No Events found!

Top