I am a beginner with OpenCL and GPU, and have some questions:
1. what is the connection between different work-groups, are they executed one by one or sometimes synchronous?
2. I know a wavefront have 32 or 64 work-items, what is essentially wavefront? a wavefront execute these work-items always synchronous?
3. here is a example about time complexity: I has 10 work-groups, each with 32 work-items, every work-item runs in O(n), what is total time complexity?
is O(10*n) or still O(n)? what about every group has 256 work-items?