I went throught the prefix sum but did not understand the kernel .
Well i have a 3d stream on the kernel and want to run prefix sum on it (the 3d stream i get is from other kernel so dont want to copy it to host and then back to the device to run prefix sum)
Dont know how to figure out the kernel for 3 dimension for prefix sum