Do I need to worry about atomic operations if work group size is <= 64 ? From previous discussion,
I do not need memory barriers for this size of work group.
My situation is: I have all work items ORing the same location in a local memory array.