So after months of troubleshooting and RMAs, I am officially at a loss of what to do. Ever since I built this machine last August I have had random BSODs and hard resets a few times a week at random use cases, middle of gaming as well as idle or just browsing the web. The BSODs have random bugcheck codes, never identifying an actual driver, always ntoskrnl.exe, and the resets that are interspersed always fail to create a minidump. I think I can recreate the crashes instead of waiting for days at a time as the system always hard resets around 8 minutes into running with driver verifier turned on along with random bugcheck codes in event viewer, again no dumps as it fails to create the dump files. Here's my system config:
• CPU: Ryzen 5 3600
• Motherboard: MSI MPG x570 Gaming Edge WiFi
• GPU: Nvidia GTX 970
• RAM: Corsair Vengeance LPX 16 GB (2x8) 3600 MHz C18
• CPU Cooler: Cooler Master Hyper 212 Evo Black
• PSU: EVGA SuperNOVA 750 G+ 80+ Gold
• No peripherals except for keyboard and mouse, which I have tried multiple of
And here is a list of troubleshooting and RMAs I already have tried:
• General driver debugging
• RAM memtest for 12 hours, 0 errors
• Tried another set of Corsair Vengeance LPX memory at 2133 MHz
• Both XMP and non-XMP
• No overclocks
• Tried different GPU (GTX 660)
• Replaced ssd with new clean nvme ssd with brand new windows install
• Replaced all power cables
• RMA'd CPU
• RMA'd Motherboard
• RMA'd PSU
• Tried earlier BIOS version
• Tried manual RAM timings
• Tried different power plans and various Windows settings
None of these changes stopped the system from hard resetting around 8 minutes into running with driver verifier on. Which kinda baffles me since I pretty much have replaced the entire system at this point except for trying out a new brand of RAM sticks or switching out the CPU cooler. I have no idea where to go from here, so any help would be greatly appreciated. Thanks guys.
Can you actually post a picture of it assembled. also I know you probably did but check your motherboard manual and make sure you have the ram in the right slots. Have you checked your infinity fabric ratio? Also i dont like how the hyper 212 evo's heat pipes have no plate under them and come in contact directly with the cpu. Ryzen 3 has its cores offset from the center and you maybe only have two heat pipes in direct contact over the module. I dunno
First can you post all the BSOD error (0x???) you are getting?
Second for troubleshooting purposes only, try to run your Windows in a "CLEAN WINDOWS DESKTOP". IF you get no more BSODs this indicates a 3rd party Startup or Driver in conflict or being incompatible with Windows.
Here how to boot into a Clean Windows desktop: https://support.microsoft.com/en-us/help/929135/how-to-perform-a-clean-boot-in-windows
It is very easy to do and undo.
Yeah sure, posted a couple pictures of the build below. One thing to note is that I went out and got an Asus Tuf x570 board to fully make sure it wasn't the motherboards fault. The system still crashes around 8 minutes in to running with driver verifier. And no I have not checked the infinity fabric ratio, not sure what exactly it is supposed to be.
And yeah the BSODs I have gotten have been errors such as IRQL_NOT_LESS_OR_EQUAL, KMODE_EXCEPTION_NOT_HANDLED, TIMER_OR_DPC_INVALID, DPC_WATCHDOG_VIOLATION, and
SYSTEM_THREAD_EXCEPTION_NOT_HANDLED. Lately the SYSTEM_THREAD_EXCEPTION_NOT_HANDLED (0x7e) error seems to be the most frequent in driver verifier. None of them reference a specific driver. They always just say it's ntoskrnl.exe.
And I just put in a clean NVME SSD and loaded it with a brand new clean install of Windows. Installed all Windows updates, AMD chipset drivers, Nvidia drivers, and Intel Wifi drivers, but the system still crashes around 8 minutes into driver verifier.
Thank you guys for the suggestions. I appreciate all the help I can get.
From what I have read about that last BSOD error it can be caused by an out-date driver.
This Tech website give a few troubleshooting tips on how to resolve your BSOD error: System Thread Exception Not Handled in Windows 10 [FIXED]
Go to Windows Event Viewer under Errors and see if you can identify a driver that is causing the problems.
Also check Reliability Monitor. this how to access that feature:
Go into Control Panel/Security and Maintenance. Under Maintenance click View reliability history. This will bring up the Reliability Monitor.
Also run DXDIAG.exe and save it to TXT file and upload it. That will show what files are having problems in your computer including Windows files.
Running a Clean Windows desktop might help in finding which program startup or driver is causing the BSODs.
yeah sometimes its something totally weird. Like when I got my 5700xt the driver at the time had a conflict with hardware rendering in chrome and would bsod.
So something I found was that when I run driver verifier but do not target the nvlddmkm.sys driver, the system does not crash. But when it's even the only driver I verify, the system crashes around 8 minutes in. So I'm not certain, but it seems like it could have something to do with the Nvidia drivers. However, I just tried cleanly uninstalling the current driver with DDU and installing an older driver from like September 2019, but that still results in the same crash. I then tried my brother's GTX 1060 to see if it would work with a newer GPU, but it still has the same crash around 8 minutes in.
Also, Reliability Monitor didn't really help, it just listed all of the kernel power failures that I already saw in Event Viewer. And the Event Viewer doesn't really show anything unusual before the crashes.
So does anyone know if there are current incompatibilities between Nvidia Drivers and the current AMD chipset or anything? That's the only thing I can think of right now.
I have installed a Nvidia GTX 1070 GPU card with the latest Nvidia driver on a AMD Motherboard 990FX (AM3+) without that type of issues. Other Users with newer Ryzen Motherboards also have Nvidia GPUs installed without that problem.
I can't see how a AMD Chipset drivers will have a conflict with a graphics driver.
It is possible another driver on your computer is incompatible with Nvidia graphics driver.
What does DXDIAG.txt show? Can you upload that file to your reply? That deals with anything to do with Graphics and other files. It is sometimes quite useful. It may indicate something you missed in Event Viewer.
By the way, RAM incompatibilities won't show up in MEMTEST86. All that will prove is that the RAM physically is not defective but not incompatible. Is your RAM listed for your Motheboard's QVL list by any chance?: Support For MPG X570 GAMING EDGE WIFI | Motherboard - The world leader in motherboard design | MSI G...
Sometimes the RAM's Timings and Speed may be causing problems. Is your BIOS set at "Factory Default"? if not, then do a CLEAR CMOS (follow manual) to put the BIOS in Factory default.
EDIT: I keep mentioning booting into a Clean Windows Desktop. Once you disable all 3rd party Startups except Nvidia Driver if it shows up and you have no BSODs, then you can, by process of elimination, start "enabling" some of the 3rd party Startups until it starts to cause BSOD in your computer.
This won't harm your computer and is a very good easy way to find out if a program or driver is incompatible.
By any chance is the PCIe x 16 slot you have your GPU installed enabled for PCIe 4.0 or PCIe 3.0 in BIOS?
If it is set on PCIe 4.0 that might be the reason for your Nvidia GPU card driver having issues: Specification for MPG X570 GAMING EDGE WIFI | Motherboard - The world leader in motherboard design |...