Subject, how can I setup register usage by kernel? Also, am I right that Evegreen has 16000 vector registers in each SIMD?
And what is maximum number of concurent work groups per SIMD engine?
So register allocation is handled automaticaly. Programmer does not need to worry. And what is the place of register spilling? Is it global memory like on Geforce? If I specify for example 256 group size with 100 registers in each thread, I will run out of register file.
And how can I obtain information about spilling registers? Via stream kernell analizer? And I think I am correct that maximum register number per work item is 256, am I?
Retrieving data ...