What is the relationship between Compute Units, Stream Cores, Processing Elements and ALU?
The definition of them has already been answered in
But description of Stream Cores doesn't match the Device Parameters. For example, HD6850 has 12 Compute Units, 192 Stream Cores, 960 Processing Elements. How to explain that?
Also I'm a little confused by the wavefront. Documents says that
That instruction is repeated over four cyclesto make the 64-element vector called a wavefront
Is wavefront constructed by 16 ALU (4 PE) by repeating 4 times or constructed by 64 ALU (16 PE)?
Also because PE comes from vector unit, does scalar unit work in GPGPU? How do they work?