    Optimization step in video "How to Optimize Image Convolution" at "AMD Developer Inside Track"



      in the above video the speaking Mr. Bordoloi optimizes image convolution.

      In the first optimization step, the data is copied from global memory to local memory. This is a step I am very interested in, but I could not figure out how to do it myself.

      It would be very kind if someone could me.