commands in command queue never work until "clFinish()" ?
I use a multi-core CPU to run OPENCL programs.
I try to "hide latency" by setting clEnqueReadBuffer()'s 3rd param "blocking_read" to CL_FLASE. but it seems that it never really execute until the "clFinish()" is called.
So I wonder if only the kernels run parallelly?