Originally posted by: MicahVillmow
Raistmer,
The problem is just that the array is to large and it fails some heuristic check that our cal compiler does on if it should attempt to use registers or not. |
That is, I should use 128 different variable names to take advantage of so many possible registers per workitem. Pity, indeed
Maybe some compiler switches (look at NV's compiler - it has option to limit/set number of registers per thread) that could replace default compiler behavior when needed?
In general, compiler can't be clever enough to cover all possible cases, right? Letting some manual control on its desisions can be very useful.
Surely my case not very suitable for GPU, but it
can be handled much better with already available hardware.