We are facing an issue with the mcelog service on RHEL, Rocky and Alma 9 vms running on vmware where the mcelog service fails to start and the required edac_mce_amd kernel module cannot be loaded with the error no such device. As per my research and conversations with RHEL support, I was referred to AMD support who referred me to this forum in search of a driver that will allow us to load the kernel module.
mcelog[1102]: mcelog: ERROR: AMD Processor family 25: mcelog does not support this processor. Please use the edac_mce_amd module instead.
sudo insmod /lib/mod
lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 43 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 2
On-line CPU(s) list: 0,1
Vendor ID: AuthenticAMD
Model name: AMD EPYC 7713P 64-Core Processor
CPU family: 25
Model: 1
Thread(s) per core: 1
Core(s) per socket: 1
Socket(s): 2
Stepping: 1
BogoMIPS: 4000.00
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mc
a cmov pat pse36 clflush mmx fxsr sse sse2 syscall nx m
mxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good
nopl tsc_reliable nonstop_tsc cpuid extd_apicid tsc_kno
wn_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2
x2apic movbe popcnt aes xsave avx f16c rdrand hypervis
or lahf_lm extapic cr8_legacy abm sse4a misalignsse 3dn
owprefetch osvw invpcid_single ibpb vmmcall fsgsbase bm
i1 avx2 smep bmi2 erms invpcid rdseed adx smap clflusho
pt clwb sha_ni xsaveopt xsavec xsaves clzero arat pku o
spke overflow_recov succor
Virtualization features:
Hypervisor vendor: VMware
Virtualization type: full
Caches (sum of all):
L1d: 64 KiB (2 instances)
L1i: 64 KiB (2 instances)
L2: 1 MiB (2 instances)
L3: 512 MiB (2 instances)
NUMA:
NUMA node(s): 1
NUMA node0 CPU(s): 0,1
Vulnerabilities:
Itlb multihit: Not affected
L1tf: Not affected
Mds: Not affected
Meltdown: Not affected
Spec store bypass: Vulnerable
Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer
sanitization
Spectre v2: Mitigation; Retpolines, IBPB conditional, STIBP disable
d, RSB filling
Srbds: Not affected
Tsx async abort: Not affected
ules/5.14.0-70.13.1.el9_0.x86_64/kernel/drivers/edac/edac_mce_amd.ko.xz
insmod: ERROR: could not insert module /lib/modules/5.14.0-70.13.1.el9_0.x86_64/kernel/drivers/edac/edac_mce_amd.ko.xz: No such device