I had terrible trouble with 2 of these CPU's installed 5900x and started receiving WHEA errors every day, tested every component with every benchmark i can think of OCCT for PSU as well as paper clip test. Memtest 86 as well as windows memory diagnostic. FurMark and 3d Mark for Graphics Card as well as Seatools for HDD.
Also tried increasing voltage on my ram and tightened up the timings. Tried PBO and XMP off still received WHEA errors. I RMA the first CPU and was shocked that this issue returned with the replacement. Also tried 3 different BIOS revisions. Since returning the 2nd CPU the Motherboard manufacturer Gigabyte has come out with a new BIOS Revision. Unsure if this did anything to improve the situation but stability on this CPU is awful and i cant keep wasting my time with this. I am thinking ill just wait for AMD to get their **bleep** together or if they dont I will just switch back to Intel.
I will definietely be more careful on my next purchase and keep checking in here to see if this issue is resolved.
Ok, looks like mine (5950X) is become stable (until now). What I have done is:
1) Go to BIOS and 'Load Optimized Defaults' (note that CPB is Enabled by default and I leave it enabled);
2) PSS Support -> Disabled;
3) Global C-state Control -> Disabled;
4) Power Supply Idle Control -> Typical Current Idle;
5) Power Down Enabled -> Disabled (for DRAM);
6) Gear Down Mode -> Disabled (for DRAM);
7) Set XMP to Enabled for the DRAM (because my DRAMs support XMP 2.0);
I have been testing the system for 5 days (24h/day of calculations, on all 16 cores) and until now I haven't had WHEA 18. Hope this will help other people.
My 2 cents,
To those who have tried everything else, here was my fix.... unplug all of the fans from the motherboard. There is an amperage rating to for the fan headers on motherboards. Apparently overloading this circuit can cause stability issues elsewhere in the PC.
I had this same random rebooting/stability issue for a few months and replaced every single component in the PC trying to fix it, including a new processor and motherboard and spent about 40 hours adjusting BIOS setting and firmware. The last component replaced was the power supply... I swapped out the 1000W power supply with a "known to be good" 750W power supply... the computer wouldn't boot with the 750W even though online calculators indicate a power supply as small as 350W would work. The computer was already stripped down to the bare minimum for diagnostics (1 video card, 1 stick of RAM) so my only option to reduce power draw was to unplug the extra case fans. With only the CPU cooler fan plugged in, the computer booted right up and had no stability issues.
With the computer stripped down and all 5 case fans plugged in, the computer is unstable.
With all components reinstalled in the computer (all 4 stick of RAM, 2 graphics cards) but with only 1 case fan in lieu of 5, the PC is stable and has not had a single random reboot in 6 weeks.
I just want to share my experience facing exactly the same issue you guys are facing.
My setup is
CPU : Ryzen 9 5900x
MOBO : MSI MPG X570 Gaming Plus
RAM : PNY XLR8 DDR4 8GBx4 3200MHz
GPU : Zotac 3070ti Amp Extreme Holo
PSU : Thermaltake Toughpower GF-1 1000W
AIO : Silverstone PF360
I've faced exactly the same with those replies before me. My PC works perfectly fine most of the time but when I was playing games, whenever I'm on the low loading scene like in the lobby for multiplayer games, my PC crashes. It didn't throw me the BSOD but only black screen or the last scene rendered by the GPU. I was annoyed a lot since I've spent a lot on the PC and also this is the first built PC by myself.
I went to Event Viewer and found out that it gives me the Kernel-41 everytime and I have no clue what had happened. I've tried methods mentioned here in the thread but I just don't want to sacrifice any performance for a stability which we should have got without any effort after buying AMD chips.
Turns out I've found out that for my system if I increase DRAM voltage to 1.42 it fixed the random reboot under low workload. According to the ram specs, it shows that those RAM uses 1.35v for operating. IIRC maximum voltage for DDR4 is 1.5v so I think raising 1.35v to 1.4xv shouldn't be a problem and this solves my issue.
btw everything else except the fan curve on BIOS I've set them all at default settings.
Hope this would be helpful to you guys.
PERIOD . = BACKSLASH
COMMA , = FORWARD SLASH
TAKEOWN ,F C:.WINDOWS.SYSTEM32.WHEALOGR.DLL
ICACLS C:.WINDOWS.SYSTEM32.WHEALOGR.DLL ,GRANT ADMINISTRATORS:F
REN C:.WINDOWS.SYSTEM32.WHEALOGR.DLL WHEALOGR.DL
THAT TAKES CARE OF THAT, NOW FOLLOW THE 'RUSSIAN GURUS' GUIDE AS WELL AS USE CPU CORE PARKING UTILITY, TURN CORE PARKING TO 0%, ALL OTHERS 100%. THIS COMBINED WITH:
With great help from some Russian gurus I finally found (I hope) a solution for my case.
Just to remind you, my problem specifics were crashes in transitions from loads to idle, in idle within 30 seconds after a transition, or when applying a load again after these 30 seconds. NEVER under a load. Turning the Core Performance Boost off eliminated the issue together with the CPU performance. If you're in the same boat, try this, it should help.
The following is for the ASUS bios, for other vendors the same parameters may be hidden in another place.
The system is stable so far with the following BIOS settings:
Go to AMD overclocking, set the Presicion Boost Overdrive to Manual. Some additional parameters will appear. In there:
(The main thing) Set the EDC current limit to 200A.
(Just in case) Set the power limit to 130W.
(Just in case) Set the temperature limit to 83C.
1 is an increase, 2 and 3 is a decrease. Leave at zeros all the rest there.
Also, just in case, set Idle Voltage to Typical, Global C-states control to Disable, check that ECO mode is Off. Then you can set Core Precision Boost back to On, everything should work.
Looks like the MB and its BIOS wasn't tested with a 5000 CPU at all (or, if it was, it was like "Ok, it boots, that means it works, great, the job's done), and the BIOS just doesn't know about the larger peak currents of Rysen 5000s, and the BIOS' "digital fuse" is just too small for a new CPU. When changing its clocks the CPU, probably, tries to draw more current, the "fuse" (EDC current limit) kicks in and the CPU malfunctions and produces a BSOD.
These currents (or how the "fuse" works) also definitely depend on the MB and/or the CPU heating (I didn't have any BSODs when cooling the open case with a cold hair fan), that explains why not everyone with the config like mine has the same problem, people with better cooling (or a colder GPU) might be ok at defaults.
I SHOULD WORK FOR AMD
For WHEA errors, please check the following KB article for suggestions on how to troubleshoot your issue: Troubleshooting Tips for Resolving System Stability Issues | AMD
Some other suggestions for WHEA errors.
1. Ensure you are on the latest BIOS for your motherboard.
2. Please check to see if using the motherboard BIOS default settings (including memory running at JDEC specs 2133Mhz) resolves the WHEA error.
3. Check that the system memory you are using has been validated by the motherboard manufacturer on the QVL (Qualified Vendor List) and that the system memory configuration you are using, eg 2/3 DIMMS has been validated to run at the memory frequency you are using.
4. Check that you are suppling sufficient voltage to the system memory for your memory. Sometimes a slight increase to memory voltage may be suggested by the motherboard manufacturer to obtain stability, eg from 1.35v to 1.36v.
5. Check that your Windows OS and Chipset drivers are up to date.
6. Check that no components on the system are overclocked or are overheating, this includes graphics cards.
7. Check that your PSU meets the requirements for your system.
8. Do not use PBO to overclock the processor (or any other component in the system).
Since the OP of this discussion has a correct answer and solution (increasing DRAM voltage) I am locking this discussion.
If you continue to experience WHEA errors on your system after checking the steps above and validating each one, please open a new discussion providing the information requested here: https://community.amd.com/t5/knowledge-base/information-required-when-posting-a-discussion/ta-p/4227...