17 R4, graphic crashing

Question

Hey all,

I have been using my Alienware 17 R4 for 3 years with no issues at all to date.
(GPU is GeForce 1080 8.00GB)
However, I cannot seem to play games without the laptop crashing after a short period of time (up to an hour), and what seems to be some sort of tearing (not sure if that's the correct terminology, but I get a black screen, followed by random coloured bars at random patches of the screen), the laptop then freezes and crashes.
In order for it to load up again, I have to reboot it several times. This has happened across multiple games I have tried, so is not isolated to a particular game.
If not gaming, I can use the laptop with no problems at all.

Because of this, I looked at the support article here for this model, which included a BIOS update (1.9.0, released on 17 February 2020), and followed all the steps on said article, including disabling G-Sync, which I thought would fix this. It did not.

I have also enabled the "Performance Mode" for the fan settings from the BIOS settings. This makes the fans loud but the problem is still there regardless.

I then used the Support Assist program to run a full test. The laptop passed all but one test. It failed on the Video Card test, after it froze and crashed, followed by a BSoD with the error code: DPR_WATCHDOG_VIOLATION.

I decided to check the Microsoft website help article for this error code and followed the steps there (starting in safe mode, uninstalling any unwanted third-party software etc) but this didn't help either.

I ran the Support Assist program again, this time just doing a Video Card Stress Test.
The moment the test crashed my laptop was as follows:

It was 30% through the Transformation and Lighting Stress Test
GPU Temperature was showing as 67C
Memory Usage was shown at 47%
GPU Usage was 99% (I take it isn't meant to be this high?)
FPS was at 482 (Again, I take it isn't meant to be this high also?

The GPU Clock Frequencies were as follows:
(NVIDIA GeForce GTX 1080 8.00 GB)

Base Clock Frequency @ 1582 MHz
Current Clock Frequency @ 1835 MHz
Base Memory Clock Frequency @ 5005 MHz
Current Memory Clock Frequency @ 4513 MHz

Not sure if this next part is relevant, but under "Video Card Information", it was showing: nvlddmkm.sys 26.21.14.4587, 4-3-2020. PCIe Link Width: x8

At first, I thought it could be from overheating. But as far as I'm aware, 67C isn't such a high temp for a graphics card.

It is also worth mentioning that the GPU fans passed the windows diagnostics test, as did the graphics card itself.

I'm unsure on how to upload files on here (if at all possible) so that I could post my dxdiag data. If it is possible, I'd appreciate a heads up on how to do that.
And if any further information is needed, I will be more than happy to supply that.

Sorry for the long post, wanted to make sure I included as much detail to make it easier to diagnose etc.
Thanks in advance!

_Jean-Michel_ · Answer

Hello,

I have the same problem since 1 week.

I have all drivers up to date and reinstall Windows but this problem is still there.

I can work and look at videos but when I play games, my computer freeze.

I launched the SupportAssist to scan the hardware and it freeze too when checking the graphic card (GeForce 1080). So it should be the problem. I think it's due to a recent update (less than 2 weeks) but i can't find witch one.

I tryed previous drivers to my graphic card but the problem is still there.

Does anyone knows what the problem is?

P.S: Sorry for the faults, I'm French and not used to write in English.

_Jean-Michel_ · Answer

General Information Service Tag : Express Service Code : Computer Model : Alienware 17 R4 BIOS Vendor : Alienware BIOS Version : 1.9.0 BIOS Date : 1/2/20 Processor Processor : Intel(R) Core(TM) i7-7820HK CPU @ 2.90GHz Clock Speed : 2.9Ghz L2 Cache Size : 1024 Memory Available Memory : 76.26% Page File Size : 37,757.2MB Available Page File : 100.00% Virtual Memory : 37,757.2MB Available Virtual Memory : 74.02% ChannelA-DIMM0 : 16,384.0MB ChannelB-DIMM0 : 16,384.0MB Network controller Adapter Killer E2500 Gigabit Ethernet Controller Bluetooth Device (Personal Area Network) Bluetooth Device (Personal Area Network) Storage Drive : C: Storage : 3 Drive Size : 98.8GB Total Available Space : 29.3GB Used Space : 69.5GB Drive : Storage : 3 Drive Size : 931.4GB Total Available Space : 416.2GB Used Space : 515.2GB Drive : F: Storage : 3 Drive Size : 930.9GB Total Available Space : 403.3GB Used Space : 527.6GB Hardware Disk Drives LITEON CV3-8D128-11 SATA 128GB Microsoft Storage Space Device HGST HTS721010A9E630 Display Adapters NVIDIA GeForce GTX 1080 Intel(R) HD Graphics 630 Keyboards, Mice & Pointing Devices Souris HID Touchpad Monitors Écran par défaut Moniteur Plug-and-Play générique Sound Devices Realtek Audio Son Intel(R) pour écrans NVIDIA Virtual Audio Device (Wave Extensible) (WDM)

_Jean-Michel_ · Answer

After few days of active research, i have found this solution witch works for me:

https://answers.microsoft.com/en-us/windows/forum/all/video-dxgkrnl-fatal-error/463585b5-3fe3-4044-82bb-35d946af8367

The default windows driver (date 07/12/2019) resolved my problem.

I hope it can help someone else.

nico-pix · Answer

Hi Stuart_N

Did you manage to solve this issue?

I have the exact same issue on the exact same computer (Alienware 17 R4, GPU is GeForce 1080 8.00GB, CPU Intel Core i7-7820HK, 16GB RAM) since few days, after a BIOS update to 1.9.0 (not sure of what was my BIOS version before the update).

As soon as the load is increasing, with CPU temperatures about 85°C and GPU Temperature about 70°C, the same crash as you describe happens : freezes, and buggy lines / squares blinking randomly, random colored bars at random patches on the screen...

The problem happens exactly the same way and same conditions on Windows and on Ubuntu.

With Furmark + a CPU Stress test, less than a minute is needed to get the crash.

This cannot be an excessive temperature issue (excessive temperature shutdown would have been about 100°C, but here the crash happens below 90°C), but it seems to be linked with some kind of instability when getting warm.

By experience, this generally is caused by insufficient core voltage, but adding 100mV in the BIOS for turbo boost makes no effect (CPU ID HWMonitor tells no voltage difference, and crash still happens)

We need the help of DELL on this subject.

Stuart_N · Answer

Hi all,

Apologies for not replying with any updates regarding this issue, for people searching for similar issues.

After contacting Dell support, I was advised to send my system to the repair depot.

After doing so, they advised me the issue was motherboard related. A certain section of the motherboard had short circuit and they were able to replace the affected section without replacing the whole board.

However, after receiving my system back, I was having issues with some software, stating "No Board Found" or something similar (can't remember exactly). So I sent it back again (covered under repair warranty) and they are currently in the process of replacing the whole motherboard (at least that's what I gather from their communications).

Since I am not familiar with computer repairs myself (I only know the very basics), I would advise anyone else having an issue similar or exactly the same as this to set up a service request and have the Dell technicians take a look at the system.

The time frame was very reasonable (3 weeks from sending my system to receiving it back) and the cost was also reasonable (around the £200 margin).

Sorry I cannot provide a fix, my computer knowledge only covers basics, but this is what I had to do to resolve my particular issue.

nico-pix · Answer

Hi, no problem and thank you very much for your fast answer!

As I'm well trained about laptop repairs and electronics, the encountered issue with my laptop (same as yours), which is instability when getting warm (just after a BIOS update in my case) is most likely to be something like an incorrect hardware voltage (a voltage which is too low causes more troubles when getting hot, while running perfectly fine when cold).

BGA soldering issues/cracks would be causing troubles even when cold, so this can be ruled out.

Short circuit could be ruled out in our case, as there is no over power consumption, and no abnormal hot points with thermal camera. And every part of the computer is working fine (apart from this crash when loaded and hot). The computer is successfully passing all tests from BIOS.

BIOS voltage adjustments may be really tight and not working very well on all laptops considering variability of CPU and GPU manufacturing for example : DELL optimizations about those voltages in new BIOS versions may have been pushed a little bit too far.

In any case in my case, the issue has been triggered by BIOS update. This is why I'm would be interested in having DELL looking at this issue, potentially with a BIOS update (even with a beta) before risking to pay a new motherboard in case the issue may be solved by a different method.

Thank again for you answer, do not hesitate to keep us informed

Looking forward for Dell feedbacks on this topic!

nico-pix · Answer

I successfully solved the problem. I'll need more time to ensure that crash aren't coming back in a few days, so that I can give you a stronger confirmation.

It was related to a bad clock setting about the Intel GPU, which was fixed to "1100 MHz" while the default value was supposed to be... "default". For Intel HD 630, "default" seems to mean Dynamic, 350 MHz to 1100 MHz depending on usage and... temperatures, of course.

So we put it back to default, restarted the computer, and we are now unable to crash the computer. Which is good news!

I used Intel(R) Extreme Tuning Utility to do so. It's persistent between boots, even if you boot another OS.

The fact that the crash were graphics glitches on the screen was a pretty obvious indicator of an Intel GPU misconfiguration rather than a CPU thing (as screens are connected to the Intel GPU). It seems that getting the GPU forced to 1100 MHz wasn't a good idea, probably because this GPU is not working fine at 1100MHz when getting hot (or it would probably need an higher voltage to keep running fine when getting hot).

Dell, if confirmed in the following days, could it be possible to release a new BIOS fixing this issue for everybody?

Woody67 · Answer

Hi

Just wondering what the outcome was in this thread. Understand from several forums that we're not the only ones having problems similar to these. Anything from Dell?? Other than swapping motherboard (which obviously doesn't work).

Would it be possible to get a more detailed run-through of how you used the "Intel(R) Extreme Tuning Utility"? And more importantly, has the problem returned??

nico-pix · Answer

Hi, sorry for the delay (holidays etc)

Some new information : the problem recently came back and quickly became stronger : after few times the computer was sometimes unable to start completely or at all, depending on we weren't sure exactly what.

The Dell auto test then signaled a memory problem, so we cleaned the memory modules contacts and put them back in place, and the problem disappeared again... !

Is this related to the initial issue? It's not sure but not impossible : Just before Intel Extreme Tuning Utility was used, we disassembled the computer to check again everything on the main board (without finding anything). So in case of weak contacts on memory modules, the fact of having it disassembled could have moved contacts a little bit... but it's strange, as memory modules were already removed / placed back before with no effect (first time we did a thermal paste replacement with no change).

So it looks like this problem can also be caused by weak contacts on RAM modules. Cleaning them seemed to solve the issue for now... if something new happen, we'll keep you informed!

A51-06 · Answer

Great job!

M4r1ngou1n · Answer

Same machine, same problem. The GPU around 70 degree froze and shut the system off.

i didn't fix the problem but what i did let me played my games.

what i did is retored the system BIOS to the factory one as described: https://www.dell.com/support/article/en-ca/sln114853/alienware-system-overheats-or-shuts-down?lang=en

and hadn't face the problem yet. clearly there was someting in the bios update tha messed things up.

Hope it helps out someone.

A51-06 · Answer

Ayy hold up, you saying 70c? My 17 R4 with a GTX 1080 OC edition wont reach 70c because I replaced the thermal paste so I would advise you do that soon because a GPUs maximum temp is 70c.

Service Tag :
Express Service Code :
Computer Model :	Alienware 17 R4
BIOS Vendor :	Alienware
BIOS Version :	1.9.0
BIOS Date :	1/2/20

Processor :	Intel(R) Core(TM) i7-7820HK CPU @ 2.90GHz
Clock Speed :	2.9Ghz
L2 Cache Size :	1024

Available Memory :	76.26%
Page File Size :	37,757.2MB
Available Page File :	100.00%
Virtual Memory :	37,757.2MB
Available Virtual Memory :	74.02%
ChannelA-DIMM0 :	16,384.0MB
ChannelB-DIMM0 :	16,384.0MB

Adapter	Killer E2500 Gigabit Ethernet Controller
Bluetooth Device (Personal Area Network)
Bluetooth Device (Personal Area Network)

Drive :	C:
Storage :	3
Drive Size :	98.8GB
Total Available Space :	29.3GB
Used Space :	69.5GB
Drive :
Storage :	3
Drive Size :	931.4GB
Total Available Space :	416.2GB
Used Space :	515.2GB
Drive :	F:
Storage :	3
Drive Size :	930.9GB
Total Available Space :	403.3GB
Used Space :	527.6GB

Disk Drives	LITEON CV3-8D128-11 SATA 128GB
Microsoft Storage Space Device
HGST HTS721010A9E630

Display Adapters	NVIDIA GeForce GTX 1080
Intel(R) HD Graphics 630

Monitors	Écran par défaut
Moniteur Plug-and-Play générique

Sound Devices	Realtek Audio
Son Intel(R) pour écrans
NVIDIA Virtual Audio Device (Wave Extensible) (WDM)

Alienware

17 R4, graphic crashing

Was this post helpful?