I think that OpenCL must have a scan/reduce/sort library like it exists for CUDA. By example there are :
But I have see nothing for OpenCL !!!
So, I think that together we can create a project on googlecode (by example) and provide this kind of library.
What do you think ? Does someone is interested to collaborate ?
You can also simply join the mailing-list : http://groups.google.com/group/cl-pp
Also, we are searching companies to support us, to hire the right peoples, to develop kernels, to test on different hardware and more. It can also be part of a thesis project by example.
Feel free to contact us.