Fantastic article by Sebastian Aaltonen on optimizing VGPR usage on GCN:
https://gpuopen.com/optimizing-gpu-occupancy-resource-usage-large-thread-groups/
Does anyone have any other tricks to add here ?