New 5950x / 3090 Build - Frequent Crashing

jklondon

Bronze Level Poster
After much wait got my 5950x and 3090 machine but its just restarts unexpectedly after a hours or so - its very random.

Looking at windows event viewer once I got a critical 41 - Kernel-Power error.

Then another time the machine didn't restart at all but the monitor went dead, the lights were on in the machine. Had to hard reboot.

Very disappointing - any advice?
 

nawru

Member
You should provide a bit more information about your system. The whole spec is required and BIOS version/date from msinfo32.
 

Ryddill

Member
I recently got a build from PCS with a 3090 and experienced crashing issues too. I've posted on this in another thread, but in short I reinstalled the Nvidia driver within Geforce Experience by doing a clean custom install and unselecting HD audio.
There's a number of threads in the Nvidia forums suggesting the audio driver on RTX cards clashes with other installed audio drivers (in my case Realtek driver with my Asus motherboard).
What I expereinced was my screens going black, audio bouncing between monitors and headset (some times more than one), BSOD, computer restarts, or as basic as applications closing (one or any combination of these). My crashes were random, 15 minutes to 4 hours. Much more stable now. I'm not suggesting your issue is identicle but it may be worth a test. Otherwise I'd recommend you post your build. Hopefully, someone more in the know can assist you with this soon.
 
Last edited:

jklondon

Bronze Level Poster
More info below ** for any advice.

Was going to run a few stress tests. Is OCCT good for this (https://www.ocbase.com/) ?

Thanks

**

MBOARD BIOS - 2502 11/11/2020
GPU DRIVER - 461.09 from Zotac site

OS Name Microsoft Windows 10 Home
Version 10.0.19042 Build 19042
Other OS Description Not Available
OS Manufacturer Microsoft Corporation
System Name DESKTOP-EKEI97M
System Manufacturer PC Specialist LTD
System Model Amd Am4 Gen3
System Type x64-based PC
System SKU 1908512
Processor AMD Ryzen 9 5950X 16-Core Processor, 3401 Mhz, 16 Core(s), 32 Logical Processor(s)
BIOS Version/Date American Megatrends Inc. 2502, 11/11/2020
SMBIOS Version 3.3
Embedded Controller Version 255.255
BIOS Mode UEFI
BaseBoard Manufacturer ASUSTeK COMPUTER INC.
BaseBoard Product ROG CROSSHAIR VIII HERO
BaseBoard Version Rev X.0x
Platform Role Desktop
Secure Boot State Off
PCR7 Configuration Binding Not Possible
Windows Directory C:\windows
System Directory C:\windows\system32
Boot Device \Device\HarddiskVolume1
Locale United Kingdom
Hardware Abstraction Layer Version = "10.0.19041.488"
Username DESKTOP-EKEI97M\ravlo
Time Zone GMT Standard Time
Installed Physical Memory (RAM) 32.0 GB
Total Physical Memory 31.9 GB
Available Physical Memory 25.4 GB
Total Virtual Memory 36.9 GB
Available Virtual Memory 28.1 GB
Page File Space 5.00 GB
Page File C:\pagefile.sys
Kernel DMA Protection Off
Virtualisation-based security Not enabled
Device Encryption Support Reasons for failed automatic device encryption: TPM is not usable, PCR7 binding is not supported, Hardware Security Test Interface failed and the device is not Modern Standby, Un-allowed DMA-capable bus/device(s) detected, TPM is not usable
Hyper-V - VM Monitor Mode Extensions Yes
Hyper-V - Second Level Address Translation Extensions Yes
Hyper-V - Virtualisation Enabled in Firmware No
Hyper-V - Data Execution Protection Yes
 

Harag

Gold Level Poster
I've ordered a build but not received it yet, but was pointed to this stick to do the stress testing when I get my PC, this might help:

 

jklondon

Bronze Level Poster
I recently got a build from PCS with a 3090 and experienced crashing issues too. I've posted on this in another thread, but in short I reinstalled the Nvidia driver within Geforce Experience by doing a clean custom install and unselecting HD audio.
There's a number of threads in the Nvidia forums suggesting the audio driver on RTX cards clashes with other installed audio drivers (in my case Realtek driver with my Asus motherboard).
What I expereinced was my screens going black, audio bouncing between monitors and headset (some times more than one), BSOD, computer restarts, or as basic as applications closing (one or any combination of these). My crashes were random, 15 minutes to 4 hours. Much more stable now. I'm not suggesting your issue is identicle but it may be worth a test. Otherwise I'd recommend you post your build. Hopefully, someone more in the know can assist you with this soon.
What card did you get?
 

ubuysa

The BSOD Doctor
After much wait got my 5950x and 3090 machine but its just restarts unexpectedly after a hours or so - its very random.

Looking at windows event viewer once I got a critical 41 - Kernel-Power error.

Then another time the machine didn't restart at all but the monitor went dead, the lights were on in the machine. Had to hard reboot.

Very disappointing - any advice?
First off, the kernel power 41 error is a symptom not the cause. It just means Windows didn't shutdown properly.

To stop it restarting (and so you can see the error messages) enter sysdm.cpl in the Run command box. Click the Advanced tab and the bottom of the three Settings buttons.

In there uncheck the 'automatically restart' box. Also ensure that the dump type is set to 'Automatic dump'.

If it BSODs look for a kernel dump in C:\Windows\Memory.dmp. If it crashes any other way look for a mini dump in the C:\Windows\Minidumps folder.

Upload any dumps you find to the cloud and post a link here.

Also get something like HWMonitor (https://www.cpuid.com/softwares/hwmonitor.html) to check your temperatures both at idle and under as much load as you can generate.
 

jklondon

Bronze Level Poster
First off, the kernel power 41 error is a symptom not the cause. It just means Windows didn't shutdown properly.

To stop it restarting (and so you can see the error messages) enter sysdm.cpl in the Run command box. Click the Advanced tab and the bottom of the three Settings buttons.

In there uncheck the 'automatically restart' box. Also ensure that the dump type is set to 'Automatic dump'.

If it BSODs look for a kernel dump in C:\Windows\Memory.dmp. If it crashes any other way look for a mini dump in the C:\Windows\Minidumps folder.

Upload any dumps you find to the cloud and post a link here.

Also get something like HWMonitor (https://www.cpuid.com/softwares/hwmonitor.html) to check your temperatures both at idle and under as much load as you can generate.
done thanks.

GPU driver update might have done the trick I hope. Ran OCCT stress testing for 5 hours (will leave overnight)
 

jklondon

Bronze Level Poster
Spoke to soon its back. Analysing the c:\windows\MEMORY.DMP got this error

VIDEO_SCHEDULER_INTERNAL_ERROR (119)
The video scheduler has detected that fatal violation has occurred. This resulted
in a condition that video scheduler can no longer progress. Any other values after
parameter 1 must be individually examined according to the subtype.
Arguments:
Arg1: 0000000000000002, The driver failed upon the submission of a command.
Arg2: ffffffffc000000d
Arg3: fffff5802d9f7920
Arg4: ffffd48fb20bdc90

Full trace here

https://pb.server.sawhney.cloud/?7002f7ab7c188a1d#DYUfAe6C38EzgyXXunJWX9LZ2Gmf5H3mJNCCZB6uB569

Appreciate some guidance as to what to do
 

ubuysa

The BSOD Doctor
I've moved your post to the Tech Support forum where it belongs. :)

Can you please upload the dump file itself, that simple text analysis isn't anywhere near good enough.
 

jklondon

Bronze Level Poster
dmp file from minidumps

it was not BSOD - pc restart.
 

Attachments

  • issue-011821-11531-01 - Copy.zip
    820.6 KB · Views: 274

ubuysa

The BSOD Doctor
This is a minidump (because it's a crash not a BSOD) and they don't contain all the kernel data structures, however it's good enough for this error.

There is a clear driver error in the active thread...

Code:
fffff580`2d9f7148  fffff802`1ca3ad8b Unable to load image \SystemRoot\System32\DriverStore\FileRepository\nv_dispi.inf_amd64_3621da861144492b\nvlddmkm.sys, Win32 error 0n2 nvlddmkm+0xcad8b

The nvlddmkm.sys driver is the Nvidia graphics driver. The module that failed was dxgmms2.sys, a DirectX driver, but it seems that the Nvidia graphics driver is the fault here (in any case, dxgmms2.sys is a Microsoft driver and this far less likely to have a problem).

If you haven't done so already I would use DDU to uninstall the existing driver (it will reboot) and then download and install the latest Nvidia driver for your OS and card. Note: Don't install the Nvidia Audio driver that comes with the Nvidia graphics driver, we've had several issues of conflicts with the Realtek audio driver and that one.
 
Top