I am trying to run ACML6 / HPL benchmark with A10-7850K APU. I seem to have hit some brick walls...
I am using the: http://www.advancedclustering.com/faq/how-do-i-tune-my-hpldat-file.html calculator...
First problem is the memory usage.I am not able to go over the Spectre allocated memory. Shouldnt ACML6 be able to detect the GPU model and use host memory instead? Does it have to copy stuff around?
Second problem is the performance, I used a case created with the HPL calculator (link above) with 1 nodes, 1 cores/node and 1024MB ram. If I run 1 process, I get about 19gflops and if I run 4 processes I get 21.6gflops. If I run 1 process with gpu inaccessible, I get 18gflops and 4 processes gives 18.5gflops. I tried to use ACML_LOG_FILTER=1 and it seems to have usegpu( 1 ) in the log entries (it is not 1 in all entries).
Anway, what is the best way to get good results? Does anybody have better HPL results?