Good morning everyone,
Is there a way to call the BLAS implementation from my own kernel?
So, not from the host program, but directly from the OpenCL kernel that I'm developing.
I would like to try and replace my hand-rolled BLAS kernels with the ones provided by AMD and start using them in my kernels (not in my host programs).
Hope my intentions are clear,