Matrix Multiplication - queries regarding blocksize and work group size

Discussion created by mohit2710 on Jan 1, 2010
Latest reply on Jan 1, 2010 by nou


I am a college student trying to understand the new opencl environment...

I am using an intel xeon CPU E5405 @2.00GHz, quad core using linux(ubuntu) operating system.

I have been studying the Matrix Multiplication code...In which I have some problems...

1. When I increase the blocksize to error "Device does not support requested number of work items"  comes....What does this error indicate about my hardware....

The maximum workgroupsize =1024 for my device..Does this indicate that my device has 1024 processing elements?


2.When i keep blocksize=32, does this mean that 32*32 elemnts are computed in parallel for a single compute unit?


3. I am curious to know what exactly is the definition of a processing element in reference to a CPU?


Any kind of help is appreciated.