This post is intended for both AMD employees to see (if that's possible), and to warn end users who may not be aware.
Since the ASUS/AMD SOC voltage issue causing CPUs to explode, I've been monitoring my voltages like a hawk with Hwinfo. I have set alerts for SOC and VDD voltages, as well as various temperatures.
The problem is that I sometimes see voltage and temperature spikes. The three values involved are VDD and SOC voltages and CPU Die (Average). Here's what happens and some observations I've made:
I have a few theories as to where the problem might lie, they fall in two basic categories: a readout error where the data is incorrect and the values aren't actually spiking, or a control error where they are spiking because the voltage protection circuitry/algorithms isn't functioning properly. These are just guesses based on my limited experience (I have a background in electrical and networking technology, but I'm a mechanical technician by trade, so my knowledge is limited)
I've been exploring online. There's a reddit thread where one person claims that these high voltages would instantly destroy the CPU and it must be an error. Others believe this is a 'double' reading, apparently they've been seeing the values as exactly double the baseline. I am not the only one having this problem.
In one exchange, ASUS asked me what the problem was, SOC was limited to 1.3 V. They also told me to turn off PBO and use a liquid cooler. They obviously hadn't read the details I provided twice already, or the myriad screenshots of my graphs. There are several people involved, asking for the same details and it seems internal communication is less than ideal. I have a custom liquid loop and even under max load my hottest temp rarely exceeds 70 degrees.
I'm hoping that AMD will see this and investigate, and that the solution is as straightforward as some updated firmware or drivers. This really needs the expertise of an electrical engineer I suspect, and one who's familiar with the product. Barring that, I hope this will at least raise awareness and prompt others to start monitoring their voltages. Here's a screenshot of one of the spikes.
Even with the update, some users are still reporting occasional spikes.
I would SET SOC and VDDIO to 1,2v or less manually.
I also advise to use vCore negative Offset plus Curve Optimizer. The last also bumps performance.
Here is a thing. Your soc current max is at 21.9A your cpu core current is at 22.8 lol... your cpuedc is at 47 edit AMPS dude lol,.. AMPS, your cpu packackage power, is 41 watts max lol, which is fine (aka, you CANT be puling **bleep** near 50 amps, using 41 watts).... If you turned more "red things on" your entire screen would be lit up) That, or, you clicked on some silly stuff in your bios (hell, i doubt they even let you do that, or your running the wrong one for our board, (i have the same, and cant do that) or you, have let it do that. Its that, or,....... its bugged. No idea what you have going on, nor, what you have done. I turned on pbo, use curve optimizer, and this rig runs like a **bleep** top.
Your vddmisc is even DOUBLE,.... 2.1
You sir.. are bugged, or you did something the bios doesnt let you, unless, you said "i want to". Look at all the other "measurements" and and be more worried about those lol. I highly doubt your pulling those numbers, but, still seeing 28 ish per core. No way. Even a spike. Run the thing dude. Relax, and run it. Play your games or whatever.
Asus x670e-e with a 7800x3d here running ddr5 at 6200 all day here.
Hi.
I am having the exact same issue. 7950x with Asus Strix x670e. No EXPO.
Bios 1416.
There are other anomalies like 0ºC readings in Core temp and L3 temp, and the minimum voltage of CPU VDDCR_VDD VOLTAGE (SVI3 TFN), UNDER 1v.
Interesting. I'm not seeing any of the low readings. I would suggest two things: try to find a pattern. See if you can narrow down when it happens, and submit a support ticket to ASUS. The more people they get this issue from, the more likely they are to devote resources to it.
Let's say it's not actually a voltage spike, but is some sort of data error, it could be from the hardware doing the reporting, the chipset drivers, windows, HWinfo and any other link in the chain from voltage to screen.
I have a ticket with ASUS and they say they're trying to duplicate the issue. I figure they're only going to expend so much effort before they wash their hands of it. I'm going to try and figure out how to measure the actual voltage with a multimeter on the board, to see if it's really spiking.
It seems that it happens when I go away from the PC for a few minutes (10-30) and then return. The sudden input seems to cause the voltage spikes. It's almost as if a device is being put to sleep, and when it wakes with activity, there's a problem keeping the voltage in check.
I've turned off all power saving features in windows to see if that makes a difference. So far, it hasn't happened again, I'll wait a few days and if I don't see it, I'll start turning them back on one at a time.
Even if it is Windows power settings, it's shouldn't be happening, and either AMD or ASUS need to fix it.
One more thing: can you verify that your low temp anomalies are happening at the same time as the voltage spikes? That could be very important info.
IF spikes are in exactly the same time, I would say it is something to do with the PSU. Did you try another PSU?
The sensor hardware/software, and in between is just bugged guys. Thats it. Nothing more. Obviously getting a zero temp, ..... is, well, not the gear, but the reading, like all of your "info" coming in.
It will crash the PC, I don’t think it’s the sensor
Okay, I’m getting this too with HWinfo. With the latest bios from the end of September. My best OC so far will give a spike to 110c on the die when no core was near that. It is the SSE benchmarks that is triggering this and it is a very quick spike. I have a liquid cooling 2x 360mm radiators on my loop. Regardless of cooler k think this spike will happen. I removed my OC setup, completely stock bios setup will spike past 95c the thermal limit. PBO enhancement 90c setting will spike to 95c. So I’m been able to recreate exactly where it will occur and it’s whenever I run SSE Passmark or SSE on OCCT. No other test or benchmark spikes like this. It is always when the test begins as well. I have had soft crashes which reboot but with my max OC setup and curve optimized it will straight up shut down my pc. It feels like a very high voltage spike that is almost at the top of the voltage curve, which goes beyond the thermal limit. It’s very interesting though that no core actually shows the same max temp as the total die.