ghueller

Linux on 3700x: spontaneous reboots caued by MCE

Discussion created by ghueller on Mar 6, 2020
Latest reply on Mar 19, 2020 by ghueller

Hi,

 

I am running Linux (Fedora 31) on my build from last July, consisting of:
- Crucial DDR4 3000 Sticks
- Radeon RX 570 (MSI)
- Asrock Phantom Gaming 4 (latest BIOS)
- Ryzen 3700x

 

The system is fast and - at least under windows 10 running fine.
Temps are ok, PSU is of high quality, memory sustains yours of memtest86 witout errors.

 

Yet, when running Linux, I get a short freeze followed by a reboot about once a week.
At the next boot, the following machine check exception is logged:

 

[    0.707393] mce: [Hardware Error]: Machine check events logged
[    0.707395] mce: [Hardware Error]: CPU 10: Machine Check: 0 Bank 5: bea0000000000108
[    0.707464] mce: [Hardware Error]: TSC 0 ADDR 1ffffbb03343c MISC d012000100000000 SYND 4d000000 IPID 500b000000000
[    0.707540] mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1583508288 SOCKET 0 APIC 5 microcode 8701013
[    0.709397] mce: [Hardware Error]: Machine check events logged
[    0.709398] mce: [Hardware Error]: CPU 12: Machine Check: 0 Bank 5: bea0000000000108
[    0.709468] mce: [Hardware Error]: TSC 0 ADDR 1ffffbba3a05a MISC d012000100000000 SYND 4d000000 IPID 500b000000000
[    0.709543] mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1583508288 SOCKET 0 APIC 9 microcode 8701013

 


AMD support more or less aborts any communication as soon as they read over the term "linux".
Any idea how to diagnose this issue any further?

 

Thank you in advance, Gerhard

Outcomes