Hello sir,
We have encountered some issues with AMD CPUs. We have been running KVM on the 3rd generation 7003 series CPUs, and it has been very stable. However, after switching to the 4th generation 9004 series CPUs six months ago, we have been experiencing frequent automatic restarts. We are using ROCKY LINUX, and after the automatic restart, we couldn't find any relevant logging in the system or IPMI.
Here are the methods we have tried:
1. Initially, we thought the issue might be overheating with the 9654 CPU. However, we also experienced the same problem with the 9634 CPU, and the temperatures were around 70 degrees Celsius.
2. We tried different versions of the kernel (4, 5, 6), but the issue persisted.
3. We originally used Gigabyte servers and later switched to ASUS, but the situation remained the same.
4. We disabled the following options in the BIOS:
• Power Supply Idle Control --> Typical Current Idle
• Global C-state control --> Disabled
5. We used the following kernel option:• processor.max_cstate=0
Do you have any suggestions for our next steps? We haven't been able to find a solution because the same KVM configuration runs without any issues on the 3rd generation CPUs.