You should consider that the underlying architecture might not be an APU but CPU+GPU hence one should be able to hide memory transfer latency some other way ( through "stream" like processing for instance ). I don't think some things should be hidden.