I have a kernel that writes results to a global buffer; these results are never read back into the kernel (they are processed by another kernel at a later time).
So, I don't want this data sitting in the L1 cache if I can help it. Is there a way of ensuring that it is not cached?
I need L1 for another array that is frequently read from and written to. This array is around 1kb per work item, so it should stay in the L1 cache.
Thanks.
I have the same problem and also want to know the answer. Using GCN assembly is fine for me.