I want to measure my kernel performance (kernel execution time) vs. number of compute unit. I have 5870 card (20 cores). How to use only 10 of these compute units to execute the kernel?.
I have read it months ago in this forum, but I can not find it. It's like I have to set an environment variable or some setting in Visual Studio, Windows?