This content has been marked as final. Show 2 replies
Originally posted by: Fuxianjun I test an addtion of two arrays with long length(for example 40000,e.g. a+b), and loop the same addtion many times(for example 100000 times). but this two kernels cost the same time, why ?
my gpu is ATI Radeon HD 5700 Series
Are you measuring kernel only time? What is your work group size?