Hi,
I am a college student and a newbie to this opencl programming...
I was trying to understand the matrix multiplication prog given in ATI samples....
1. The main problem that I have been encountering is to understand the blocksize which is by default taken to be 8. I really couldn't understand the importance of blocksize,what it does and how would changing the blocksize affect the timing considerations...
2. The kernel file : I was tring to understand the kernel file but I would appreciate if someone could tell how to go about understanding the file....
Any type of help will be greatly appreciated..