AnsweredAssumed Answered

Frequent L2 ITLB Parity Errors

Question asked by stratvox on Feb 18, 2020
Latest reply on Feb 21, 2020 by stratvox

Hi folks!

 

I recently got a 3950x and I've found something in my system logs that seem to be... problematic? Here's a sample of what I'm seeing:

 

--------8<--------
Feb 9 13:02:30 aegaeon kernel: [1020883.656868] mce: [Hardware Error]: Machine check events logged
Feb 9 13:02:30 aegaeon kernel: [1020883.656871] [Hardware Error]: Corrected error, no action required.
Feb 9 13:02:30 aegaeon kernel: [1020883.656876] [Hardware Error]: CPU:14 (17:71:0) MC1_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|-|-|-]: 0xdc20000000070011
Feb 9 13:02:30 aegaeon kernel: [1020883.656879] [Hardware Error]: Error Addr: 0x000055f722bb5000
Feb 9 13:02:30 aegaeon kernel: [1020883.656881] [Hardware Error]: IPID: 0x000100b000000000, Syndrome: 0x000000003a000008
Feb 9 13:02:30 aegaeon kernel: [1020883.656883] [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 7, L2 ITLB Parity Error.
Feb 9 13:02:30 aegaeon kernel: [1020883.656885] [Hardware Error]: cache level: L1, tx: INSN
Feb 9 13:04:57 aegaeon kernel: [1021031.109135] mce: [Hardware Error]: Machine check events logged
Feb 9 13:04:57 aegaeon kernel: [1021031.109137] [Hardware Error]: Corrected error, no action required.
Feb 9 13:04:57 aegaeon kernel: [1021031.109141] [Hardware Error]: CPU:30 (17:71:0) MC1_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|-|-|-]: 0xdc20000000070011
Feb 9 13:04:57 aegaeon kernel: [1021031.109145] [Hardware Error]: Error Addr: 0x000055f71dab5000
Feb 9 13:04:57 aegaeon kernel: [1021031.109146] [Hardware Error]: IPID: 0x000100b000000000, Syndrome: 0x000000003a000008
Feb 9 13:04:57 aegaeon kernel: [1021031.109148] [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 7, L2 ITLB Parity Error.
Feb 9 13:04:57 aegaeon kernel: [1021031.109150] [Hardware Error]: cache level: L1, tx: INSN
Feb 9 13:07:41 aegaeon kernel: [1021194.945056] mce: [Hardware Error]: Machine check events logged
Feb 9 13:07:41 aegaeon kernel: [1021194.945058] [Hardware Error]: Corrected error, no action required.
Feb 9 13:07:41 aegaeon kernel: [1021194.945063] [Hardware Error]: CPU:14 (17:71:0) MC1_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|-|-|-]: 0xdc20000000070011
Feb 9 13:07:41 aegaeon kernel: [1021194.945066] [Hardware Error]: Error Addr: 0x000055f7233ae000
Feb 9 13:07:41 aegaeon kernel: [1021194.945067] [Hardware Error]: IPID: 0x000100b000000000, Syndrome: 0x000000003a000008
Feb 9 13:07:41 aegaeon kernel: [1021194.945069] [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 7, L2 ITLB Parity Error.
Feb 9 13:07:41 aegaeon kernel: [1021194.945070] [Hardware Error]: cache level: L1, tx: INSN
Feb 9 13:10:09 aegaeon kernel: [1021342.397362] mce: [Hardware Error]: Machine check events logged
Feb 9 13:10:09 aegaeon kernel: [1021342.397365] [Hardware Error]: Corrected error, no action required.
Feb 9 13:10:09 aegaeon kernel: [1021342.397369] [Hardware Error]: CPU:30 (17:71:0) MC1_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|-|-|-]: 0xdc20000000070011
Feb 9 13:10:09 aegaeon kernel: [1021342.397372] [Hardware Error]: Error Addr: 0x000055f71ffd3000
Feb 9 13:10:09 aegaeon kernel: [1021342.397373] [Hardware Error]: IPID: 0x000100b000000000, Syndrome: 0x000000003a000008
Feb 9 13:10:09 aegaeon kernel: [1021342.397375] [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 7, L2 ITLB Parity Error.
Feb 9 13:10:09 aegaeon kernel: [1021342.397377] [Hardware Error]: cache level: L1, tx: INSN
Feb 9 13:12:53 aegaeon kernel: [1021506.233277] mce: [Hardware Error]: Machine check events logged
Feb 9 13:12:53 aegaeon kernel: [1021506.233280] [Hardware Error]: Corrected error, no action required.
Feb 9 13:12:53 aegaeon kernel: [1021506.233284] [Hardware Error]: CPU:14 (17:71:0) MC1_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|-|-|-]: 0xdc20000000070011
Feb 9 13:12:53 aegaeon kernel: [1021506.233287] [Hardware Error]: Error Addr: 0x00005622c1a81000
Feb 9 13:12:53 aegaeon kernel: [1021506.233288] [Hardware Error]: IPID: 0x000100b000000000, Syndrome: 0x000000003a000008
Feb 9 13:12:53 aegaeon kernel: [1021506.233290] [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 7, L2 ITLB Parity Error.
Feb 9 13:12:53 aegaeon kernel: [1021506.233292] [Hardware Error]: cache level: L1, tx: INSN
Feb 9 13:15:20 aegaeon kernel: [1021653.685612] mce: [Hardware Error]: Machine check events logged
Feb 9 13:15:20 aegaeon kernel: [1021653.685615] [Hardware Error]: Corrected error, no action required.
Feb 9 13:15:20 aegaeon kernel: [1021653.685620] [Hardware Error]: CPU:30 (17:71:0) MC1_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|-|-|-]: 0xdc20000000070011
Feb 9 13:15:20 aegaeon kernel: [1021653.685623] [Hardware Error]: Error Addr: 0x000055f7236b2000
Feb 9 13:15:20 aegaeon kernel: [1021653.685624] [Hardware Error]: IPID: 0x000100b000000000, Syndrome: 0x000000003a000008
Feb 9 13:15:20 aegaeon kernel: [1021653.685626] [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 7, L2 ITLB Parity Error.
Feb 9 13:15:20 aegaeon kernel: [1021653.685628] [Hardware Error]: cache level: L1, tx: INSN
-------->8--------

 

Now, I don't necessarily expect there to never be an error, but these seem to be happening pretty frequently. I'm wondering if I should RMA the processor or if anyone here can shed any light on what's going on.

 

Thanks!

Outcomes