I see clMagma has BLAS, but they have regular host API interfaces requiring CL objects. Is there any prospect of that functionality getting nice BOLT algorithms? Is there a technical reason this couldn't or shouldn't be done? Would AMD accept this as open source contributions? In my case, I really need sGEMM.