Hello,
I run a decent ASUS server (RS700A-E11-RS12U) with two AMD EPYC 7763 64-Core Processors and 512GB RAM.
I get errors like this being displayed every once in a while
[Hardware Error]: Corrected error, no action required.
[Hardware Error]: CPU:56 (19:1:1) MC27_STATUS[Over|CE|MiscV|-|-|-|SyndV|-|-|-]: 0xd82000000002080b
[Hardware Error]: PPIN: 0x02b688d2ab0a8065
[Hardware Error]: IPID: 0x0001002e00001e01, Syndrome: 0x000000005a000009
[Hardware Error]: Power, Interrupts, etc. Ext. Error Code: 2, Link Error.
[Hardware Error]: cache level: L3/GEN, mem/io: IO, mem-tx: GEN, part-proc: SRC (no timeout)
Latest BIOS and Firmware already installed.
Linux kernel (latest Proxmox kernel):
Linux node18 5.19.17-1-pve #1 SMP PREEMPT_DYNAMIC PVE 5.19.17-1 (Mon, 14 Nov 2022 20:25:12 x86_64 GNU/Linux
Any ideas what this could be?
Any clues?