cancel
Showing results for 
Search instead for 
Did you mean: 

Archives Discussions

tomhammo
Journeyman III

vector vs scalar memory operations

from the performance guide:

"The GPU memory subsystem can coalesce multiple concurrent accesses to global memory, provided the memory addresses increase sequentially across the work-items in the wavefront and start on a 128-byte alignment boundary."

so code like the following would be most efficient:

float* data = ...



data[get_global_id(0)] = ...

... = data[get_global_id(0)]

however, does this also apply to vector data?

 

float4* data = ...

 

data[get_global_id(0)] = ...

... = data[get_global_id(0)]



regards,

- Tom

0 Likes
12 Replies