Hi,
I am a college student trying to understand the new opencl environment...
I am using an intel xeon CPU E5405 @2.00GHz, quad core using linux(ubuntu) operating system.
I have been studying the Matrix Multiplication code...In which I have some problems...
1. When I increase the blocksize to 64....an error "Device does not support requested number of work items" comes....What does this error indicate about my hardware....
The maximum workgroupsize =1024 for my device..Does this indicate that my device has 1024 processing elements?
2.When i keep blocksize=32, does this mean that 32*32 elemnts are computed in parallel for a single compute unit?
3. I am curious to know what exactly is the definition of a processing element in reference to a CPU?
Any kind of help is appreciated.