Hi, I am confused about how GCN scheduling work.
Let's say, if I define work group size is 32 and global work items is 262144 in OpenCL, then in four cycle clock there will be 32 PE (Processing Element)
( 16 PE in first clock cycle and 16 PE in second clock cycle) that worked and the other PE is idle for single wavefront right??
The total wavefront that worked will be 8192 wavefronts.
How about GCN ?? If I define work group size less than 64, are there idle PE or it is dynamically work for the next wavefront??
Can someone explain to me by using these two images ??
Wavefront Execution Example: SIMD vs. VLIW. Not To Scale - Wavefront Size 16
Copyright to Hiroshige Goto