Just wanted to share...

I took a quick look at running the SimpleConvolution sample on my Intel based system with varying workgroup sizes and perf compared to the regular CPU implementation - http://bit.ly/Y0WZu


