Unsolved
5 Posts
0
1142
September 4th, 2023 17:08
Dell PowerEdge r720 Correctable memory rate exceeded for DIMM A1
Hi there, I have randomly received this error and it states in the Idrac GUI page under logs "Correctable Memory error rate exceeded for DIMM A1" upon further research and moving the ram stick around to a different slot. I do not see the error anymore and doesn't give me an log stating that the issue has been Deasserted. If you look at the second image during boot the System does in fact detect a hardware changing after moving the ram stick from A1> B4 and B4>A1.
After doing so and upon going into the Lifecyle controller under Hardware Diagnostics and running a test on the memory it tells me a ePSA Error Code of ePSA 2000-0125 which basically means " The IPMI System event log is full for various reasons, or logging has stopped because too many ECC errors have occurred." After clearing the logs and booting the system back up and running the same test again I now receive an ePSA Warning stating "Mem ECC warning: Memory Sensor, transition to non critical from OK DIMM_B4 was asserted.
Does that mean that the ram stick I have installed is going bad? even after moving it to a different DIMM Slot. Does it also mean that i will need to replace that ram stick in the near future? I also have looked at both of these threads to see how to resolve this issue.
R720 Critical Mem ECC Warning
Troubleshooting memory errors on PowerEdge systems by swap testing
No Events found!



WANGSHUNFA11
2 Posts
0
September 5th, 2023 00:57
1. update BIOS first , if BIOS version is not up to date after comparing with website post
Support for PowerEdge R720 | Drivers & Downloads | Dell US
2. if the suspected memory card such as A1 and B4 that still have the same problem symptom in different slots , then replace them.
fastandloud386
5 Posts
0
September 5th, 2023 17:21
Okay thank you for the information we will keep you posted.