    GCN: local memory barrier and work group size


      I have a kernel with work group size equal to half wave front (64), running on GCN arch.

      Can I dispense with local memory barriers for this kernel?

      I realize that this may not work for future micro-archs, but for GCN arch up to and including Fury,

      is this advisable to remove barriers?