By chance, I created a device buffer for my kernel, but did not access it.
The creation of this unused buffer changed the performance of my kernels.
The size of the buffer was small.
Is this a bug in the driver perhaps? I am using the latest crimson driver,
on windows 7 64 bit.