Hi,
Please refere the below pdf for details about memory object Section 4.5
http://developer.amd.com/download/AMD_Accelerated_Parallel_Processing_OpenCL_Programming_Guide.pdf
Yeah did that. Thanks
Now my only concern is that the performance of the attached code and many other is quite poor on AMD CPU (A10-5800K) whereas its quite good on AMD GPUs and even on Intel cpu and gpu (i5-3470). Any explanation for that?
Will it be possible for you to share the performance numbers here. Let me see how i can help on this. Even i dont the reason for such a behaviour. anyways i will try to figure out this. Please share the performance numbers.