eduardo, The header file has the API along with comments on the correct arguments, error codes, etc... This feature also is for CPU only and does not apply to running multiple kernels concurrently on a single GPU.
Say you have M CPU cores, this extension lets you assign kernels to N of your M CPU cores. This way a program can execute a different kernel for each CPU core.