I use pytorch to run a model on MI100, I want to know work-group number per computer unit, or some detial information ablout the kernel function.
The profiler in pytorch not support. I don't have much rocm developing exp, so could I get it by python or shell?