Hi,
Is there an AMD specific faster way to do a bitshift on the GPU, like rotate(x,y) does generically? I'm doing a LOT of these on scalar uint and that's one big bottleneck.
thanks
-- edit -- Nevermind, amd_bitalign ftw!