Hi, I was wondering how does ATI implement the CL_MEM_ALLOC_HOST_PTR flag in the clCreateBuffer() function (i.e. where does it allocate the buffer?). This is supposed to allocate the memory on the host, but Iam guessing that as some stage it has to be transferred to the device when the kernel executes.
Is this correct? If so how fast/slow is this implementation, and when should it be used?