Reported ALU:Fetch = 2.92
Reported ALU:Fetch = 4.67
There are no loops in either kernel.
Kernel 1 is running faster, does this make any sense at all? It doesn't to me, why is this? The only thing I can think of is that they are FETCH bound!?
Both kernels run in about the same time, so that's my conclusion.
MY QUESTION IS THIS: Why is the SKA reporting seemingly incorrect ALU:Fetch ratios? I could see this if the Bottleneck was FETCH or Global Read/Write, but it reports it as ALU (even though it's obviuosly not).
I'm just curious why the strange ALU:Fetch ratio reporting? Any ideas anyone?