I have a system with 3 Radeon Vega XTX Frontier Edition to do machine learning: I have a bug in the system that make a lot of problem in training phase. I managed to isolate the problem in one of the graphic card: I mean this graphic card at some time make a mistake in his calc ...
Is there a tool to check / diagnostic the graphic card and confirm may hypotesis?
Thank in advanced,
Max