I would have a question.
I have a PC with two HD4870X2 cards, for a total of 4 RV770 GPUs in the machine. I have written a kernel for parallel soft tissue simulation, and as expected, the required frequent data exchange can be a bottleneck.
On the software side, I use an architecture very close to diagram 3.11 in the Stream Computing User Guide (rev. 1.4.0), with one CPU-thread per GPU.
So the question is: is there a way to copy a memory resource resident in one video memory directly to the other video memory on the same card, without ever going across the PCIe-bus?
Likewise, is there a way to copy a resource on one card directly to the other card via PCIe-bus, without going through system memory?
Thanks a lot