Unsolved
18 Posts
1
8866
March 11th, 2021 19:00
Aurora R10, 5950X, random power-off restarts
About once a week or so, my R10 abruptly powers off and turns on again. There's nothing special going on on the machine when this happens - it's always low load conditions, low fan, cool system (basically near idle CPU). This has been basically happening since it was brand new (about a month old now). It has never happened while gaming or under any high load activities.
The machine config: Ryzen 5950X, 64 GB (3466 MHz), RTX 3080, 2TB SSD (stock config from factory)
In the Windows Event Viewer, the last *error* entry logged is:
A fatal hardware error has occurred.
Reported by component: Processor Core
Error Source: Machine Check Exception
Error Type: Cache Hierarchy Error
Processor APIC ID: 2
The details view of this entry contains further information.
There are *lots* of other warnings like this:
A corrected hardware error has occurred.
Reported by component: Processor Core
Error Source: Unknown Error Source
Error Type: Bus/Interconnect Error
Processor APIC ID: 0
The details view of this entry contains further information.
Based on online searches, this seems to be related to the 5950X itself when it's running against the RAM at 3466 MHz (as on my system build sheet). Does anybody else have this configuration and face these issues?
https://www.reddit.com/r/AMDHelp/comments/jsl5ie/unexpected_hard_reboots_whealogger_cpu_warnings/


phpfreak
44 Posts
0
March 15th, 2021 19:00
I forgot to add in. I forgot to check the ram if it is overclock. I will try to remove it to see if that works. If that's the problem it is still depressing for a system that only run at 2666Mhz. Thank you for the tip.
phpfreak
44 Posts
0
March 15th, 2021 19:00
Hi, my ram is not overclock. However, I disabled the overclocking on the bios. I want to see how long it will run.
koshyjohn
18 Posts
1
March 15th, 2021 21:00
The variables affecting your system (config, modifications, symptoms, results, etc.) are different than mine (and Nzumbe's, which appears identical to mine but in a separate thread).
To make it easier for other people, including Dell support / possibly engineering, to follow the specifics of each situation, I'd suggest starting a separate thread in the forum and detail out what you have here (copy-pasting your notes here should be fast). You can also leave a link back to this thread too to suggest some commonality.
We can dive deeper into questions and suggestions at that point (and I have some of each for you).
Does that make sense?
phpfreak
44 Posts
0
March 15th, 2021 21:00
Too late. I installed the alienware command center and oc. I am trying to increase the fan speed. While tinkering around with radeon software to adjust the brightness to clear up the font viewing, it reboot again. This system is not usable right now for actual normal used since I am on the lowest load.
phpfreak
44 Posts
0
March 15th, 2021 21:00
Same problem with rebooting but I found out mine is the heat problem. When the heat reach 65 degree and even with a sudden short spike, the system going into reboot.
-Disabled overclocking on cpu and memory.
-Fan going 36% faster which lower down the temp at 31 degree.
-CPU cores are running at 2.1 ghz with few cores going up a bit and back down.
I able to run longer hours until just now while tinkering the Radeon software. I don't want to overdone the radiotor and fan replacement, or move my cpu and gpu to a new mobo/case yet since that would definitely void my warranty and I only start using it since last night.
phpfreak
44 Posts
0
March 16th, 2021 10:00
So far my computer is running now with auto like when it was shipped with chasis closed. House AC is off and room temperature is 73 degree F. GPU temp is low, speed and utilization is low since it is at low load as always.
CPU 3591.88Mhz with random cores spike to 4.3 to 4.8ghz
Ram is at 1333Mhz x 2 = 2666Mhz
Front fan 100% and top fan 75%
CPU: ~37C degree
No reboot so far since the CPU is kept at 37 degree Celcius in average. So, the problem is still there and this is not normal. The last reboot, was
Chasis closed
CPU 3591.88Mhz with random cores spike to 4.3 to 4.8ghz
Ram is at 1333Mhz x 2 = 2666Mhz
Front fan 37% and top fan 36%
CPU: high 40s to 50s degree. It reboot at 55 degree from the time I let the login page sit for few minutes while sending message to the support on my another computer. After I able to login, it reboot itself but I managed to see the temperature reading from Core Temp. Core Temp and Hwinfo64 reporting the same thing.
The conclusion is there is a heat problem with my system. The cpu probably couldn't tolerate heat which is less than allowable heat so cpu is probably having an issue too. I read from other post which somebody having similar issue which include both cpu and motherboard. He replaced his motherboard after they found out his mobo has issue. After the replacement of the cpu it was resolved. For my case it is hard to tell because I don't have another 5000 series cpu to diagnose or another Ryzen motherboard. All I have is threadripper, old amd phenom, intel laptops. Going to buy some parts to build another system with cheap $54 nvidia 730 msi small gpu while working on this.
Compy-H2O
4 Posts
2
March 16th, 2021 11:00
I have the exact same issue and configuration the 2021 R10 with 5950x except I am running a Rtx 3090. I am running with no overclock at all, and my 64GB of ram at 2933mhz and still having the same problem. Just random restarts, usually while using office or browsing the web, once while playing LOL, but never while doing more intense tasks like playing COD Cold War.... I was starting to fear that there was maybe something wrong with the water cooler and was maybe overheating, but the temps have always been fine, low 60s. I Installed another m.2 4tb pcie ssd through slot 4 and am also having an issue with the computer is not detecting the drive if I shut the computer off and reboot. In order for it to detect the drive I have to put the computer in sleep mode. When I wake it my drive is suddenly available again. These issues need to get fixed fast. This is supposed to be mostly a work computer video editing etc, and it just isnt reliable at all.
phpfreak
44 Posts
0
March 16th, 2021 13:00
Sorry to hear that. Yeah, my drive couldn't be detected on 2nd nvme on pcie so I pulled it off. Right now I just leave everything blank except a ssd. I am going to ship it back and have them tinker it or replace whichever component that is not good.
Current idle is at 36-37C (which I kept referring to low load) but I am making that correction. While doing nothing and just sitting there the temperature is really high. At least your system could still run at low 60s but if the problem is still there you better send it back to have them take a look. While on low load it will jump to 40s and 50s. On stress test 40s - near 70s. It will only work because I set my front fan at 100% and top fan at 75%. If I am using the default profile at 27% fans, the system will reboot itself even for few minutes used with temp jump to 50s, 60s where it is going for reboot at anytime. The first I was using it I thought it will only reboot at 85. So there is something there beside the heat. My 5950x simply cannot handle high heat.
Try adjust your fan speed and set a profile for that. However, be warned it is loud like vacuum cleaner. I don't have the sound measurement now but it is extremely loud. With chasis open, it is cooler and of course with faster fans it is even better. Well, this is the first time I don't have any reboot with 100% / 75% fan running and has been running like maybe 6 hours now?
I am waiting for the support to send me the shipping label and will pack it in the box later. It is good thing that I still have my Threadripper 1900x and 580x runing 24/7.
Nzumbe
13 Posts
1
March 17th, 2021 02:00
Heya!
Hope you're doing fine and had some success with support! Just wanted to forward you a message from one of the Dell-cares staff member:
That is actually a quote but is quite missing the point of our problem. I'm still in contact with support but had since this answer the worst experience so far. But still not giving up, because if 3400MHz XMP is not expected to run, they shouldn't sell it.
Please let me know if you had more success. Feel free to also contact me with a PM
phpfreak
44 Posts
0
March 17th, 2021 09:00
Unfortunately for me, no news of getting shipping label yet. System is not running out of the box without painful configuration.
-Default 27-31% Front and Top., when chasis is closed, 3.4 ghz - BAD.
-100% Front fan and 75% top fan, when chasis is closed , 3.4 ghz - Good.
-37% Front fan and 36% top fan, when chasis is closed, 2.1 ghz - Ok but reboot is still possible
-27-30% Front and Top fan, when chasis is opened. 2.1ghz to 3.5 ghz.---Ok but Reboot is still possible
It could be poorly thermal grease application on the cpu or bad cpu. Either way, I am still waiting for dell to send me a shipping label. I am already considering return the whole thing and save the money for threadripper pro.
koshyjohn
18 Posts
1
March 17th, 2021 20:00
Ok so update on my machine: Based on private messages from Dell-Cares, I sent out a very detailed record of the analysis of what's going on with my machine (summary of this topic basically), along with a dump of event-logs from when a abrupt-power-cycle event occurred. This was based on an ask from me to bypass the support escalation stack (properly justified with supporting technical details, after basic troubleshooting) and go straight to product engineering. Of course this level of analysis would not have been possible in the first place without Vanadiel's pointers - so a lot of appreciation there.
Within the day, I was told Dell/Alienware's product engineering wants to 'capture' my PC to run further tests on it to understand what exactly is going on. It's going to be replaced with a equivalent refurb (a system they've already opened and tested functionality on) so that they can do this testing freely on mine. Hopefully this benefits everyone buying or who've bought this configuration.
I've owned many high-end machines over the years (including from Apple), but this is on track to be one of the smoothest support interactions I've had when the machine's had issues right off the bat. I'll only breathe easy if the replacement exhibits none of these issues, but I'm hopeful since it was described as having been taken out of the box to test already.
Vanadiel
8 Professor
•
7.1K Posts
•
29.6K Points
1
March 18th, 2021 03:00
Do keep us posted, because this sounds very interesting. When the new machine arrives we can compare with the old one, see what is different.
phpfreak
44 Posts
0
March 22nd, 2021 09:00
Hi, I will. I am still waiting for them to send me a label. Right now I don't want to tinker with it too much since I don't want to void the warranty. The only reason I bought this system instead of building one myself is because of CPU and GPU which is non-existent right now on retail market. A month later, now I have heavyweight non-working system which is sad. I wish I waited for threadripper pro to come out. 128 lanes!!
phpfreak
44 Posts
0
March 23rd, 2021 05:00
This is Dell last reply after promising me a label.
"Thank you for getting back to us. We are sorry for the delay in getting the service done. We will go ahead and check this for you. Once we have an answer, we will let you know."
Before that, I decided to keep the system but return it to them for repair. However, now I feel like to get refund for the thing. If today, I am not getting the label I will ask for the refund. Will use the money to build a new system (epyc or threadripper pro) and let go of the gaming side.
I don't like to come down to this but I am not getting anything out from this purchase.
phpfreak
44 Posts
1
March 23rd, 2021 15:00
What a let down from Dell again. Never ever buy pre-built again even if parts are unavailable at retail. I will just wait them out. It seems like they couldn't provide any service at the moment for this system.