We have a problem with the 11.4 driver (SDK2.4/CCC 11.3 is ok) in that it cause a specific kernel of ours. The problem is that it causes the driver to either freeze the computer/screen or causing a BSOD in ATIKMPAG.SYS.
The kernel in question is attached and is a transpose kernel that takes from src and puts into dst.
The local workgroupsize is 32,8,0 (2 dimensional) and the global size is 256,256.
The first time we ran this kernel, it ran just fine and produced the correct result. The next invocation of the same exe file causes windows 7 to lock up (or it bluescreens).
double extension is enabled using:
#pragma OPENCL EXTENSION cl_amd_fp64: enable
Original mem pointers are src and dst and from those, clCreateSubBuffer is used to make an offset into the array (the offset for this test is 0): // create pointer to subbuffer subSRC = createSubBuffer(src, CL_MEM_READ_ONLY); subDST = createSubBuffer(dst, CL_MEM_READ_WRITE); // set kernel arguments ccSetKernelArgs(kernel, 4, "cl_mem", &subDST, "cl_uint", dst.lda, "cl_mem", &subSRC, "cl_uint", src.lda);