Hello,
I run a decent ASUS server (RS700A-E11-RS12U) with two AMD EPYC 7763 64-Core Processors and 512GB RAM.
I get errors like this being displayed every once in a while
[Hardware Error]: Corrected error, no action required.
[Hardware Error]: CPU:56 (19:1:1) MC27_STATUS[Over|CE|MiscV|-|-|-|SyndV|-|-|-]: 0xd82000000002080b
[Hardware Error]: PPIN: 0x02b688d2ab0a8065
[Hardware Error]: IPID: 0x0001002e00001e01, Syndrome: 0x000000005a000009
[Hardware Error]: Power, Interrupts, etc. Ext. Error Code: 2, Link Error.
[Hardware Error]: cache level: L3/GEN, mem/io: IO, mem-tx: GEN, part-proc: SRC (no timeout)
Latest BIOS and Firmware already installed.
Linux kernel (latest Proxmox kernel):
Linux node18 5.19.17-1-pve #1 SMP PREEMPT_DYNAMIC PVE 5.19.17-1 (Mon, 14 Nov 2022 20:25:12 x86_64 GNU/Linux
Any ideas what this could be?
Any clues?
Solved! Go to Solution.
Problem was solved by unscrewing and rescrewing the CPU properly into the CPU socket!
Problem was solved by unscrewing and rescrewing the CPU properly into the CPU socket!