Unsolved

This post is more than 5 years old

Closed

5582

November 1st, 2019 14:00

T3610 - Seemingly Random Crashes & During SupportAssist GPU Stress Test

Hi everyone,

I've previously been a fan of troubleshooting on Reddit, but after dealing with my PC randomly crashing for almost 10 months at this point, I am hoping that coming here to the Dell forums is "calling in the big guns."

To start off I have checked and verified that all Drivers, BIOS, and Firmware are up to date.

My issue is that after putting together my PC from mostly used parts (only my SSD and GPU power cable are new) I am continuously struck by an issue of my PC screen going dark on random occasions with my only remedy being to hold the power button and hard-shutdown (I never get a blue screen or error message.)

After the shutdown, my logs report nothing unusual other than "hard shutdown at this time by holding the power button." This issue occurs during very specific stress tests listed below, while gaming, and even during very idle operations like web browsing.

Why I am here on the Dell forums: Oddly enough my PC will not crash even during the most strenuous stress tests such as the one I did today running Prime95 + MSI Kombustor Furmark/Unigen Heaven/UnigenValley for almost a combined total of 2 hours - with no errors. The odd thing is, however, I can replicate the crashes 100% of the time by running the Dell SupportAssist GPU Stress Test.

Specs:

Pre-built Dell T3610 .

  • Intel® Xeon® E5-1650 v2 Processor (Six Core HT, 3.5 GHz Turbo, 12 MB)

  • 4x4GB DDR3 1600 MHz and 1866 MHz RDIMM ECC/Non-ECC

  • EVGA NVIDIA GeForce GTX 1060 3GB

  • 685W Stock Power supply

Symptoms:

I frequently have the exact same issue happen where seems like the GPU crashes mid game or sometimes even idle

  • Monitors connected to Display Port and HDMI both lose signal

  • Mouse and Keyboard are still powered however they lose connection (e.g. my Corsair Keyboard defaults to the onboard color configurations and the Caps Lock no longer registers with the light)

  • Sound output still works (e.g. Spotify will continue playing music, will continue onto the next song)

  • Video output only resumes after holding the power button and restarting the PC

Stress Tests:

While I have had this issue rarely while idle or low stress tasks like browsing the web, stress testing the PC has some interesting results

Specific Stress Tests
  • Prime95 + Unigen Valley - Never Crashed on Extreme - Temps Maxed out near 72 degrees C

  • Prime95 + Unigen Heaven - Never Crashed on Extreme - Temps Maxed out near 72 degrees C

  • Prime95 + MSI Kombustor Furmark - Temps Maxed out near 72 degrees C
  • OCCT GPU and CPU Stress Test - Never Crashed

  • User BenchMark - Never Crashed

Gaming
  • League of Legends - Crashes very frequently (even sometimes on the launcher and getting ready to enter a game) - Temps have been very low during some of these issues

How to replicate:

One way I seem to be able to replicate the issue with almost 100% certainty is running the Dell "SupportAssist" GPU Stress Test. It seems that this specific stress test hits the GPU exactly where it weak and causes the crash.

Conclusion:

Based on these tests I have a hard time determining if the culprit could be my GPU, Power Supply, or neither. It is very confusing to me that I can run concurrent GPU and CPU stress tests with no issue yet crash idle and almost every time I try to play some video games.

I have also looked into the potential of the power supply being too small, however based on the below PSU calculator, I believe my stock 685W that came with the T3610 should be sufficient.

PSU Wattage Calculator

OuterVision PSU Calculator part list

Type Item

Motherboard Desktop
CPU 1 x Intel Xeon E5-1650 v2 3500 MHz Ivy Bridge-EP
CPU Speed 3500MHz
CPU Vcore 1.2V
Memory 4 x 4GB DDR3 Module
Video Card Set 1 1 x NVIDIA GeForce GTX 1060
Core Clock 1506MHz
Memory Clock 2002MHz
Storage 1 x SSD
Storage 1 x SATA 7.2K RPM
Other Device 0 x USB 3.0 Device
Keyboard 1 x Gaming Keyboard
Mouse 1 x Gaming Mouse
Fan 1 x 80mm
Computer Utilization 16 hours per day
Gaming/Video Editing/3D Rendering Time 4 hours per day
Monitor 2 x LCD 27 inches
Load Wattage 411W
Recommended Wattage 461W
Amperage +3.3V: 10.2A, +5V: 11.3A, +12V: 26.7A
Recommended UPS Rating 1000VA
Generated by OuterVision PSU Calculator 2019-10-11 02:29:50

6 Posts

November 2nd, 2019 05:00

You say that the GPU Temp maxed out at 72° and that should be OK, but it might be card specific. I had a Quadro 4000 on a Lenovo workstation that would cause instant shut down when it hit about 90°c, even for an instant. The code is built in to immediately power off the computer to prevent damage to components. Run HW Monitor (https://www.cpuid.com/softwares/hwmonitor.html) during your day and it'll note the peak temps. My problem: DUST. My machine was  in an open rack but near the floor and I'd neglected to clean it out. It was massively overheating, but a good clean and all was well. However some cards are much worse than others and to be safe I swapped the Quadro 4000 for a K4000 which uses less power and stays much cooler. 

If it's not heat (and the inbuilt "protective shutdown") it could still be that one spot of GPU memory that's duff or any number of things. PSU's are often the culprit for bad behaviour - they don't last forever and fluctuating power causes all sorts of symptoms, like disks dying (so you think it's just disk failures) and so on. Again, HW Monitor will show you voltages....good luck, there's nothing worse than trying to narrow down nasty problems like this. It's worth collecting a box full of spares so you can try replacing one bit at a time to narrow things down. Also, get a replacement PSU to keep in storage - they won't be available forever for some of these custom-design machine

November 2nd, 2019 14:00

Thanks GenghisQ, thats a really good point about having a backup power supply so I'm ordering one now and will see if it helps with the issue. I have unfortunately had the problem since building the PC when it was immaculately clean so I don't think any new dust is the problem, but I will also do a deep clean when I try out the new power supply.

6 Posts

November 2nd, 2019 23:00

I'm only talking about dust and muck building up on the fans and vents to restrict airflow, so if your fans are all working and there's no obvious clogging you should be OK (again, monitor the GPU temp to see how it's going). I'd be borrowing another graphics board to try out if possible - although blown graphics boards are not a good thing it's much better to be able to pinpoint the problem to a single replaceable card. I did have a Quadro go belly-up after only a year or two (it was replace under warranty) so a faulty card can happen, and is one of the least nasty outcomes. A bad PSU always leaves you wondering what else might have been damaged...

1 Message

March 3rd, 2020 18:00

Is it turned out yet what was the problem? I have got the same issue... Thanks.

2 Posts

April 9th, 2020 19:00

I have the exact same problem with almost the same components. It's honestly driving me mad.

These are my specs:
CPU: Intel Xeon E5-1650 v2 @3.50GHz
GPU: Nvidia GeForce GTX 1070 Founder's Edition 
RAM: 64Gbs DDR3
OS: Win 10 x64
Storage: (1x) SSD Sata III 2.5" (1x) HDD
PSU: 685W Stock

This is what I've tried so far:
I replaced the CMOS Battery.
Tried running the PC with another GPU (GTX 1080)
Installed Win 10 on both SSD and HDD and ran them separately
Tried 2 different Monitors (I thought it was a monitor problem)

Nothing helps and like you said @AcesOfSpade it keeps freezing randomly with the only remedy is having to perform a hard shutdown. Have you solved the issue perhaps?

April 9th, 2020 21:00

@Krokodilus @Zzeino  - The issue ended up being a bad video card, I probably should have gone with a more refutable seller from ebay. After I replaced my card with an RX580 8GB I had no issues.

2 Posts

April 9th, 2020 21:00

I replaced my GTX 1070 8gb with my friend's GTX 1080 8gb (that had no problems for him) and the same issue still persisted.

I can't think of anything else to do but replace the whole motherboard, which is gonna force me to replace the RAM (since they're DDR3) and the CPU. 

1 Message

May 8th, 2020 02:00

I had my motherboard changed by Dell Support but still problem persists. Also tried diferent graphics card. Also no heat anomalies. Other ideas?

May 8th, 2020 03:00

@AcesOfSpade  Sorry to bother, but I see you've put in an RX580, I'm planning to do the same, but the power requirements of that card is stressing me out. (Sapphire Nitro+ RX580 8GB ~ 235Watts).

May I ask how you set it up power connection wise? (my card needs both 8 pin and 6 pin)

How much power does your card draw?

 

(I made a post about it here if you want more info) 

1 Message

August 13th, 2020 01:00

same problem with 3 different pc

1 Message

December 5th, 2020 08:00

Hi everybody
I have a similar problem, on a similar configuration, only 32GB of RAM.

The black screen was happened on the EVGA 1060 6GB video card in games, and then in the browser.

Replaced the video card to 1650 - 4GB. At first (2 months) there was no problem, then it appeared again.

I ruled out the problem with the video card.

To reproduce this problem, I used Dell SupportAssistant (test graphic card, normal and full test)

The solution I was found on the Nvidia forum.

https://www.nvidia.com/en-us/geforce/forums/geforce-graphics-cards/5/272303/black-screens-new-1080gtx-fe/

You need to turn off VT-d in BIOS (this is the second checkbox in the virtualization section)

During 2 weeks there were no black screens. Try and describe your tests

Thanks

1 Rookie

 • 

7 Posts

May 7th, 2021 11:00

I've got the same problem, same video card and same specs except the CPU is an E5-2650v2. I replaced the 425W PSU with a 685W. The system is cool and clean with a max of 90F (I'm old). Swapped cards with known good ones. Turned off VT-d, then all virtualization. Nothing in the logs indicating a system lock. I'm running out of ideas. The only clue so far is that I dind't get any black screens on an RX 460, which requires no extra power. Would appreciate any input.

1 Rookie

 • 

7 Posts

October 14th, 2021 09:00

As a follow-up for future problem revolvers - replacing the 425W PSU with a 685W unit has done the trick. But I had to uninstall a 3rd party antivirus product. The AV app replaced a couple Dell HW drivers with its own. How utterly stupid on the part of the product developers using such shoddy tactics to inject itself into a more privileged level of the OS. For my son and my purposes - Microsoft Defender is well able to handle a multitude of security needs for browsing the Web and playing multi-user games. It doesn't have to fight the OEM and OS software to do so.

2 Intern

 • 

202 Posts

October 14th, 2021 18:00

The best RAM test I know is TM5

https://www.overclock.net/threads/memory-testing-with-testmem5-tm5-with-custom-configs.1751608/

(original: https://testmem.tz.ru/testmem5.htm

use google translate if you have troubles with Russian)

It runs in full power only with admin credentials, but still can test 2Gb chunks in parallel from userspace. Run it two times first: at first run it configures a ram direct-access driver, at second it can test with high performance. Configure it for 200 cycles, for 2 channels... even if you have 4 channels)

 

don't forget DELL's RMT feature in the bios, it should be anabled to read logs. ONLY ECC RAM leaves logs, unbuffered DIMMs may be checked only with TM5.

 

>4x4GB DDR3 1600 MHz and 1866 MHz RDIMM ECC/Non-ECC

You didn't mix ECC with non-ecc right.... did you?

1 Rookie

 • 

92 Posts

October 14th, 2021 19:00

There was a time ago, when i was running the SupportAssist Hardware test, my system was crashing. And it was happening because INTEL VROC DRIVERS version 6.xxx. When i upgrated my drivers to version 7.xxx the crashes disappeared.

0 events found

No Events found!

Top