Hello,
I have a new sever with AMD EPYC 7282 thar randomly goes to black screen every day and gets frozen, we lose IP connectivity, shh access is not possible, etc. The only way to recover is to force a restart
The machine is running Centos 7, var log messages does not display anything, it just looks the logs are suddenly interrupted like in a event of power loss.
We know is not a power loss problem because IPMI module ( XCC lenovo) is still active when this happens.
IPMI does not show any event of HW error.
Server is in a cold room so temperature is not an issue.
/var/crash does not have any dump core file
I would appreciate any help or advice to fix this or to know where to look
Thanks!