Start a Conversation

Unsolved

16 Posts

7687

July 26th, 2022 11:00

XPS 8940, randomly freezing

XPS 8940

XPS 8940

So here's a fun one. I too (as many on this forum have), have been having random system freezing issues with my Dell XPS 8940 with Nvidia RTX 3060 Ti video card. After a lengthy phone call with Dell support, the support engineer in their infinite wisdom has decided that the system doesn't have a hardware problem and if I'd like any further help resolving this situation, I can fork out another $100 for "software" support. System is less than 1 year old and still under warranty. I spent close to $2500 for what was supposed to be Dells "top of the line" Desktop. So, I'm left with 2 choices - either live with a problematic dell desktop or shell out another $100 for their software support and HOPE that their software support personal can help me fix this issue. Needless to say, I will be seriously considering whether or not to purchase ANYTHING else from Dell, ever.

4 Operator

 • 

1.7K Posts

July 26th, 2022 11:00

Well, I see Support has changed its tune. Used to be re-install the drivers, doesn't work, Windows... and then might be motherboard of video card need to be replaced.

It is not Windows or the Video card.

I wonder if they want to find someone that they can 'test' on to determine the cause?

Anyway, some more info from you, what BIOS are you running?

Did you try back-leveling the BIOS to V2.3.0? That fixed it for most of us.

I assume you've used the Intel HD750 and had no problems as well? Ask Dell how can they explain that if it is a s/w problem?

Another choice, get a different Video card, but NOT Nvidia.

16 Posts

July 26th, 2022 12:00

Hey Irv,

Received system with Bios 2.4.0 - Currently running 2.8.0 (recently updated BIOS)..  It is my understanding that there are some significant security vulnerabilites that have been patched in BIOS since 2.3.0, I'm somewhat hesitant to downgrade that far back.  I have not pulled the RTX card out of the system as I'd also lose 1 of my 2 monitors to do so.     I may have to bite the bullet and go find me a decent AMD Radeon someplace and throw away the $$$ that I spent on the Nvidia...   It's more an annoyance than anything else as a simple FORCE power off, reboot and I'm up and running for anywhere from 3 hours to 5 days before the system locks up again.   I used to be a Dell fan, but after this experience and being invited to spend yet more money if I want support for a system less than a year old that I've already spent a ton of money on, I'm much less inclined to either A) buy future Dell products and B) recommend Dell to others...    I've been in IT for more than 30 years (primarily in Systems Administration and Engineering and I do have many people ask me what I think of various hardware/software companies.   I can tell you that after this support call and the current "recommendation for further help", my willingness to recommend Dell has decreased significantly.

If Dell is looking for someone to "test" on, I'm waiting with an open mind and ears. Unfortunately I have yet to find a way to reproduce this error on command.. It seems to happen most often when I have like either Netflix or YouTube open (not necessarily playing anything - but open).  However even that is not 100% reliable to cause the system freeze.   If Video is playing, it'll start to stutter and then the keyboard/mouse become unresponsive, but not always.     

If I were the only person to have experienced this problem, I'd be a lot less inclined to complain this loudly about being asked to spend even more money for any help.  The hardware support guy did NOTHING more than I have already done at least a half dozen times since I've had the machine trying to figure out what's going on.  System logs are useless at finding root cause.  

Thanks for listening.

 

Steve

4 Operator

 • 

1.7K Posts

July 26th, 2022 13:00

Steve. I hear you.

I've been in this boat since V2.4.0 came out.

Yes, going back that far is not the wisest thing to do. But I suspect CVE risks to a home user is minimal.

I am on V2.8.0 myself. I've not had, what I call, a 'Lock-up', since 7/15.

Take a look at my Reliability Monitor:

aa.jpg

Concentrate on the red box.

I put on V2.8.0 on the 7th of July I think? On 7/19 I added some RAM. Since then, working fine.

I do have another question? How much RAM? Total size and number of DIMMs? Do you KNOW the RANK they are running at, Single or Double.

I ask as that could be a hint or possible verification of why?

See, there is another user here, with 2 'identical' configurations, when shipped basically. One doesn't lock-up, the other does. The User did make a change though. Took the 2 8GB DIMMS from one XPS and added them to the other one. Then Bought new RAM so he'd have 64GB''s and the other XPS (that does Lock-up) has 32GB's.

What is the difference you ask? The RANK of the DIMM's. Locking up one is SINGLE Rank, the one that isn't, DUAL Rank..

I suspect you have 2 8GB DIMMS for 16GB (standard base RAM on 8940's). You can use CPU-Z to get that info.

ispalten_0-1658865818736.png

 

Yes, we all have the same issue, happens randomly, no one can find any info why, and no common action or cause it seems.

I am sure you know the hardest to debug are random event, you never know what kicks it off.

Me, I'm sure Dell is working on it, as each release of the BIOS from V2.5.1 to V2.8.0 seem to have the problem occur less often. They just have not fully located it in the code (I don't think they have a system that can reproduce the problem?).

Yes, it has taken way too long... and I have lost data in some programs (settings mostly as the program had them open, reading, or writing them when it happened and the file was corrupted... not fun at all.

I would if nothing more put down V2.3.0 to VERIFY you can keep running a few days with no lock-up. Then use V2.8.0 as it seems the most stable to me... and maybe SOON Dell will have a new BIOS with the fix.

16 Posts

July 26th, 2022 13:00

And I thought I jumped through hoops to find the root cause....       Very thorough there Irv. 

 

At the moment I have 32 GB (2x16).   Am considering an upgrade to 64 (4x16) as even at idle I sit around 19-20GB used RAM (Actual IN USE - not cache in use)....   Between Docker, WSL, VScode, and a few other utilities that are generally running on my machine - I keep it busy.  I've only been on 4.8.0 now for about a day now - I'll have to see what's what over the next few days.   Fingers crossed "we're getting warmer"....   If it continues to be a problem, I'll have to dump the Nvidia (now there's something to make me sad) and go with an AMD...    Maybe I can find a friend that has an extra that I can borrow for a few weeks and see if I get the same results with an AMD card (not expected based on what I've read in the forums).    It would be nice if Dell would step up to the plate, admit they have a compatibility problem of some type with their 9840 system boards, recent BIOS builds, and Nvidia GPU's.  Would even be nicer if they would offer to trade for comparable AMD card for folk that are affected by this and are still under warranty, but I'm not going to hold my breath on that one.   Agreed CVE risk is minimal, however - I use my machine for both personal and professional purposes and as such am loathe to not be on the most current BIOS/Firmware/OS/app patches so 2.3.0 is a no-go for me as a solution.  I think at the moment I'm going to let 2.8.0 cook for a few days and see if it's any better.   With your mention of dual rank not locking up may accelerate my plans to move to 64GB RAM on the system (4x16) vs single rank RAM config.   I've kinda been kicking around a RAM upgrade anyway.  Anyway, your recommendations and insights are greatly appreciated.   Hoping someday soon I can update this post that I've not seen the lockup in months and all is well, ideally without having to dump the Nvidia card...   

Have a great day!

Steve  

4 Operator

 • 

1.7K Posts

July 26th, 2022 14:00

Steve, it seems that Dell ONLY ships SINGLE RANK DIMMS. It you still have the original, either 2 8GB's or 1 16GB's (which I have, I intended to go up to 32GB's later and I did it this month finally) you more than likely have SINGLE RANK. Do not confuse that with Single or Dual Channel. It is the Access Method basically of using RAM.

The thought is Single RANK is the cause. Please use CPU-Z to check. It is NOT a function of how much RAM you have.

They are not the same, other than they use the same 'name'.

Look at these links:

https://www.oempcworld.com/support/singlevsdualram.html 

https://www.hardwaretimes.com/single-channel-vs-dual-channel-ram-which-one-is-better-for-gaming/ 

RAM DIMM and Channel are not the same other than by name. I suspect you already know this though. Still good reading.

I had worked with Dell Support I guess in Jan. when this first hit. I 'suffered' the re-installs of Windows, Safe Mode, all that, and all the drivers. Discovered using the HD750 on-board video I had no problems. So Dell Support replaced my Motherboard AND Video card. Still happened. Tried another motherboard, still a problem. I asked for an AMD card... and they said sure, until they discovered they could only replace what I had. At one point Dell wanted to exchange my XPS, but I didn't like the terms. May be refurbished, will be what you have or better, might be new, might be the 8950. I could NOT delete anything. No-go, some programs I have are single use license, and I'd have to remove them before I put them and register them on a new PC. Worse, it took me a few weeks to get everything the way I want it. New system I might not be able to restore it with my back-up either... refused.

If you are not into heavy gaming, try the HD750 on-board video, it will not fail. Once Dell fixes it you can go back to the Nvidia card. Other choice you have.

 

 

16 Posts

July 26th, 2022 16:00

Irv,

Thanks much for the background info.   I was familiar with # of channels but not particularly familiar with single vs dual rank.   Those articles were awesome...   I have double checked my PC configuration using CPU-Z... Sure enough, although both 16GB DIMMS were dual channel, they ARE in fact Single rank.  That should be resolved on Thursday, I have 2 new 32GB dual RANK DIMMS coming my way.  I'll pull the single rank 16s and put them in a corner somewhere to be replaced by the new 32s.  Maybe between that and 2.8.0, this will stop being an issue for me.  My biggest issue with the HD750 is that there is only a single port.   I would not be able to survive (ok, maybe that's a little over dramatic) with only a single monitor, but my life would  big time and I'll be a very sad engineer trying to fit my entire view onto a single monitor.     Cross your fingers and say a little prayer for me that the dual rank and current BIOS put this silliness to bed.   Again, you are a Godsend with your knowledge on this subject...  MUCH more helpful than the Dell support engineer.   I greatly appreciate your time and efforts on my behalf.   Dell, if you're reading this - hire Irv.... He knows his stuff!

-Steve

4 Operator

 • 

1.7K Posts

July 26th, 2022 16:00

Steve,

When you get the NEW DIMM's in that are Dual Rank and they are the ONLY ones in the XPS, please PM me or reply here with the details. I can probably pass it onto someone in DELL, it might be the CLUE we are missing.

Only problem, we might think it is fixed only to have it pop up again days or weeks later.

If you DO work, go to DUAL RANK, I might just buy another 16GB Crucial DIMM and junk the Sumsung 16GB Single Rank Dell provides.

Good Luck.

Oh, Dell didn't hire me but they did notice I help people and made me a 'Technical Expert''. I can put that on my signature and use a unique Avatar, I've chosen not to. Thanks for the compliment though as well. Glad to help.

1 Rookie

 • 

85 Posts

July 28th, 2022 05:00

My XPS 8940 has a 16GB single rank stick (SK Hynix) in slot# 1, and I added a second 16GB dual rank stick (Crucial) in slot# 4.  CPU-Z indicates memory is dual channel.   Motherboard is 0K2CM7 and GPU is a Nvidia RTX 2060 Super.   Have not experienced freezing problems while in Windows (using Win 10 Home), but did experience PC locking up during startup shortly after the Dell logo.  This seems to be BIOS related, (currently at 2.8.0), but also disabled unused Sata drives per one recommendation here. 

4 Operator

 • 

1.7K Posts

July 28th, 2022 06:00

Interesting....

I am set up slightly different after buying and adding a Crucial 16GB DIMM (can operate as both Single and Dual Rank) to my existing Single Rank Samsung.16GB DIMM.

Original Samsung was in slot 2, 2nd below the CPU. I put the Crucial into Slot 4. I am on V2.8.0 and have had no lock-ups since 7/15, having added the DIMM on 7/19.

I've begun not to trust either CPU-Z (and other programs) or the XPS how it is reporting DIMM slots.

For instance, this is what I see in CPU-Z for both DIMM's:

c1.jpg

This shows I'm running Dual Channel and speed is 2933Mhz as it is DDR so it is 2X

c2.jpg

Here is where it starts to go wrong I think?

First, it is called Slot 1? But the DIMM is in Slot 2? Rank is Single which I know the DIMM is.

c3.jpg

The Crucial (actually it is Micron) it says it is in Slot 2, but I know it is in Slot 4. Also Single Rank?

Another program however does call them by the correct slot, AIDA64:

c4.jpg

DIMM2 and 4 which corresponds to the slots.

You are actually running 2 channels though. Slot 1 and 3 are the 1st bank (channel) and 2 and 4 are the 2nd. Normally one would put the White Clips first (at least that is how Dell does it I think?).

Check this link, https://www.intel.com/content/www/us/en/support/articles/000005657/boards-and-kits.html  as it explains more what DIMMs go where. From it:

This is how Dell ships a single DIMM system:

ispalten_0-1659015891180.png

 

This is what I've done:

ispalten_1-1659015960667.png

 

What I believe you have is 2 channels each with a single DIMM in it. However, the Crucial should have been in DIMM1 or Slot 2. Maybe that is why you having a problem after the Dell Logo.

Yes, you are Dual Channel as you are using both. It is possible the Hynix will show Single Rank, and the Dell Dual Rank as you've got them in different banks (channel).

I'd really be interested in seeing your CPU-Z output for both DIMMs?

 

 

1 Rookie

 • 

85 Posts

July 31st, 2022 10:00

Been awhile since I installed the extra memory, so opened up the box to see where its installed. The Hynix is in slot#2 (channel#2) and the Crucial is in slot#4 (channel#1), and task mgr shows the memory is running 2933MHz. However, CPU-Z shows these sticks in different slots for whatever reason. Here are my screenshots.

 

 

ishmael_1-1659288085425.png

 

ishmael_2-1659288137370.png

 

 

ishmael_0-1659288005890.png

 

4 Operator

 • 

1.7K Posts

July 31st, 2022 11:00

Hmm, what you have is you are using 2 Banks/Channels. Each DIMM is in its own channel, but it will run Dual Channel that way. It is also Single Rank, and that is not what I'd expect (assuming the Crucial DIMM is 2Rx16)? Each Bank/Channel should be its own Rank I'd think? Maybe not, if you have one Single Rank channel then that is how the Memory Controller works?

You might have 'solved' the CPU-Z slot naming method. Bank 1 is slots 2 and 4, and Bank 2 is 1 and 3. So in my case I'm in Slots 2 and 4, which is Bank 1 and CPU-Z reports my DIMMs in Slots 1 and 2, e.i., the first Banks slots. You see Slot 1 and 4, the 1st slot in Bank 1, and the last possible slot, the 2nd in Bank 2 which is 4.

Any, I am freeze with mixed Rank DIMMs in the first Bank, Slots 2 and 4.

This not the solution, using the same or different banks for DIMMs still fails.

Some have tried on Dual Rank and those have had failures too.

Seems everything is a dead-end so far. Yes, some with the Nvidia cards can work fine, but the reason is clearly unknown.

4 Operator

 • 

1.7K Posts

August 1st, 2022 07:00

By the way, since you are using both channels, and each is not fully populated, you might not be getting the full performance from the RAM. Go back and look at my CPU-Z captures from ‎07-28-2022 09:50 AM.

My Unicore Frequency is 4x or so what you have and Row Refresh Cycle Time is almost 2x as fast.

Move the Crucial DIMM to slot 4 (physical, 4th from the CPU) and I'd expect those numbers to match mine.

Not clear though if you'd notice and improvement or not? I guess it is if all Single Rank on one channel has the Memory Controller perform better than running 2 individual channels, one Single Rank, the other Dual?

1 Rookie

 • 

85 Posts

August 2nd, 2022 11:00

Not sure this is clear, here is where my memory is located. Note: slot#1 is closest slot to cpu.

Channel# 1: Slot# 1: empty, Slot# 3: Crucial
Channel# 2: Slot# 2: Hynix, Slot# 4: empty

I would think moving the Crucial to slot# 4 would eliminate dual channel. Yes? Also, it appears that Uncore frequency can vary up and down, at least looks that way in mine after just checking it compared to what I posted above.

4 Operator

 • 

1.7K Posts

August 2nd, 2022 13:00

OK, i'll assume the Frequencies can vary? Mine appear solid though? Might be the configuration you have. don't know?

First, from what I've read, most sites when the talk about 'channels' or 'Bands' they assume 2 DIMMs in each.

When they talk about Dual and  Single channel, they also assume fully populated.

I've seen not that fits your specific case, 2 channels, one Single Rank, the other Double Rank.

In most cases, I've seen sites that consider only 2 DIMMs in a channel and do some measurements. Dual Channel is 'better' but it also depends on the application at the time.

I guess you can do some performance measurements yourself, as-is not and with them both on the same channel.

With both on the same channel, the Memory Controller only has to deal with one channel, and that, to me, would mean less switching between channels. I assume that means less time lost not accessing RAM.

If you do find out that what you did is faster, post it here, I am interested in that.

I also suggest you move the Crucial to Slot 4 and use CPU-Z to see if those frequencies are higher? Lower readings could be due to the Memory Controller looking for the empty slots to see what they contain, and in my case they are all in one channel.

1 Rookie

 • 

85 Posts

August 3rd, 2022 05:00

My understanding, when utilizing dual channel, is that the system will access both channels simultaneously. https://beebom.com/single-channel-vs-dual-channel-memory/ In terms of performance, studies show only slight improvement. But if you have the right amount of the right memory sticks then there does not appear to be any good reason to not configure dual channel.

Can you show the cpu-z output for your configuration and which slots exactly are your memory sticks located in. Also, can you post the memory speed from task manager/performance /memory.
Btw, the command wmic devicelocator also does not show the actual slot either, similar to cpu-z. See, https://social.technet.microsoft.com/Forums/en-US/87d923a3-5f84-4ae9-a61b-32c65e76a6ae/actual-ram-slots-used-wmi?forum=ITCG
No Events found!

Top