Sorry, I could not find any other documents. Looks like related old links are broken now. Here is a forum discussion which may be helpful to get some insights on DirectGMA.
https://community.amd.com/t5/archives-discussions/directgma-between-a-fpga-and-gpu/m-p/304709
To interact with FirePro/RadeonPro GPU via DirectGMA, the 3rd party device/card needs to support DirectGMA as well. You may check with the card vendor if they can provide some samples and other references that demonstrate how to use DirectGMA for their card.
If your target platform is ROCm, I would suggest to check the documents available here: https://github.com/RadeonOpenCompute/ROCm
Please note, ROCm related support is provided at it's GitHub site. Please use the below link to post any query/issue.
https://github.com/RadeonOpenCompute/ROCm/issues
Does the GPU-framework allow for GPU-exclusive-loops that are infinite in time?
Sorry, the question is not clear to me. A typical program flow for GPU related host-code may look like below.
while(1) {
1. Wait for the frame data to be written in buffer
2. Launch a kernel to process the frame data in buffer
3. Once the processing is done, copy the result as required.
4. When the frame buffer is ready to reuse, send a signal to the frame grabber unit to load next frame
5. Display the result or send it to a target unit
}
P.S: To improve the performance, two buffers can be used as ping-pong buffers. While GPU is busy with one buffer, the other buffer can be used by the frame grabber to store the next frame data.
Thanks.