guys, i still cannot definitely to imagine HOW gcn engine works as united unit, i mean dynamic view of wavefront, work-group, loading data to (from) registers. all i have read cannot be assembled into united alive dynamic movie. if anybody knows where i can look such movie (or picture sequence) with simple kernel example where instructions and data are flowing between global memory, LDS, registers and ALUs.
Best regards to all!