I have a simple program in OpenCL which is using the Bolt 1.3 library. The code is following:
using ctrl = bolt::cl::control; ctrl bolt_control; bolt_control.setForceRunMode(ctrl::OpenCL); int N = 1024; bolt::cl::device_vector<int> devV(N, 0, CL_MEM_READ_WRITE, false, bolt_control);
I have executed this code in CodeXL. To my surprise I found that the function clReleaseContext takes 96% of the execution time. (please take a look at attached picture).
Could anyone please tell me why it takes so much time?
I've attached my clinfo log to show you how my OpenCL system looks like. In addition GPU displays the window manager at the same time, can it be the root cause of the issue?
Thank you in advance for your help.