cancel
Showing results for 
Search instead for 
Did you mean: 

Server Gurus Discussions

afink
Journeyman III

CPU corrected errors MC27_STATUS

Hello,

I run a decent ASUS server (RS700A-E11-RS12U) with two AMD EPYC 7763 64-Core Processors and 512GB RAM.

I get errors like this being displayed every once in a while

[Hardware Error]: Corrected error, no action required.
[Hardware Error]: CPU:56 (19:1:1) MC27_STATUS[Over|CE|MiscV|-|-|-|SyndV|-|-|-]: 0xd82000000002080b
[Hardware Error]: PPIN: 0x02b688d2ab0a8065
[Hardware Error]: IPID: 0x0001002e00001e01, Syndrome: 0x000000005a000009
[Hardware Error]: Power, Interrupts, etc. Ext. Error Code: 2, Link Error.
[Hardware Error]: cache level: L3/GEN, mem/io: IO, mem-tx: GEN, part-proc: SRC (no timeout)

Latest BIOS and Firmware already installed.

Linux kernel (latest Proxmox kernel):

Linux node18 5.19.17-1-pve #1 SMP PREEMPT_DYNAMIC PVE 5.19.17-1 (Mon, 14 Nov 2022 20:25:12 x86_64 GNU/Linux

Any ideas what this could be?

Any clues?

0 Likes
1 Solution
afink
Journeyman III

Problem was solved by unscrewing and rescrewing the CPU properly into the CPU socket!

View solution in original post

0 Likes
1 Reply
afink
Journeyman III

Problem was solved by unscrewing and rescrewing the CPU properly into the CPU socket!

0 Likes