First of all I'd like to thank you for releasing the APU (gpu-compute) model for the gem5 simulator.
I've been trying to run some benchmarks with it following the MICRO slides you made available here GPU Models - gem5 , but I am having problems getting the kernels to read memory locations allocated by the host.
I have used the compiler toolchain and the simplified OpenCL 2.0 runtime API you provide to create the binary.
I've compiled the runtime with debug symbols and everything appears to be going OK, but it seems the kernel always gets the arguments to memory allocated by the host as null pointers:
The reduced API you provide has no implementation of clSVMAlloc or clSetKernelArgSVMPointer, so I am allocating the memory in the host with a normal malloc call, and passing the argument with clSetKernelArg.
Checking the debug output of the OpenCL runtime, it seems like its getting everything OK: