3 Replies Latest reply on Jun 15, 2010 7:30 PM by MicahVillmow

    Large Allocation Makes Kernel Run Slowly


      I'm using a 5870 with Win7 and the ATI StreamSDK 2.1

      I wrote a matrix multiplication kernel, and if I declare a 64x64 matrix of floats, my kernel takes approximately 3 times longer, even if I don't touch the array at all.  Does anybody know why this is?