Asynchronous DMA and Computation

I haven't found much recent information on this subject on the forum.  I saw that in v2.7 that asynchronous DMA and kernel execution was supported so I am unsure how relevant the older (i.e., ~1 year old) posts are on this subject.


To perform asynchronous read, write and execution do I need to have three command queues with APP v2.9 or can I do this with one (out-of-order command?) queue?


Are out of order command queues supported with AMD GPUs at this point?