AnsweredAssumed Answered

AMD GPU S7150 x2 Host Instability Issues

Question asked by rklingaman@carroll.edu on Jul 23, 2018
Latest reply on Oct 11, 2018 by rklingaman@carroll.edu

I have a total of 8 of the S7150x2 cards in 4 hosts and I'm experience some host reboots that I'm trying to track down. These logs were located and I was curious if anybody had more information on them if they could be an issue:

 

2018-07-09T17:58:58.924Z cpu8:75342)amdgpuv_log: idle_vf:952: [amdgpuv]: IDLE_GPU failed on VF3, status:0xff

2018-07-09T17:58:58.924Z cpu8:75342)amdgpuv_log: switch_vfs_step_by_step:1130: [amdgpuv]: IDLE_GPU failed on VF3

PCPU 28 locked up. Failed to ack TLB invalidate (total of 1 locked up, PCPU(s): 28).

2018-07-09T17:59:08.404Z cpu24:66013)@BlueScreen: PCPU 28 locked up. Failed to ack TLB invalidate (total of 1 locked up, PCPU(s): 28).

2018-06-22T04:47:44.769Z cpu47:66015)Failed to verify signatures of the following vib(s): [amdgpuv-cim]. All tardisks validated

Outcomes