cancel
Showing results for 
Search instead for 
Did you mean: 

Archives Discussions

landmann
Journeyman III

SKA Statistics - Different methods for different architectures

The following picture of a simple stockham radix-2 made me think about what is going on in the SKA...

After compilation, the kernel's loops are unrolled for all architectures (I used constants in an #ifdef SKA - #endif block just for this purpose).

The numbers I get for the 4xxx series are more or less sane, but all numbers for the 5xxx series completely depend on the settings for avg. loop count. I could understand this in case the compiler was unable to unroll loops, but this happened. So why do these settings have any influence at all?

I think it is important to get proper feedback from these tools, otherwise they are quite useless. Atm I would rather measure the performance than using SKA if I had some more HW around 😕

Same kernel, all loops known at compile time (unrolled asm)

 

0 Likes
1 Reply
himanshu_gautam
Grandmaster

Landman ,

Can you please provide this kernel.

0 Likes