I am working on some fairly large 3d volumes and generate gradients in both x, y and z at the same time. I have an iterative solution working on it, but am having some trouble fitting it all on the card.
The input volume is 184 by 220 by 184 in float (28.4MB). The working volumes are the same resolution, but in float4 (nearly 113.7MB a piece, 227.3MB total). Total: 255.7MB. Adding any extra volumes fails which seems to indicate a 256MB memory limit for OpenCL.
Is it possible to increase this? It is kind of small. :-P