The OpenCL compiler uses LLVM code generation for the x86[_64] architectures. You can verify the optimizations done by looking at the LLVM project.
Could you please give some more suggestions?
I am using windows7 & Visual studio, and when I build the CL program, I did not pass in any parameters from the CL API, just use the default ones. Will the AMD APP driver automatically make the LLVM compiler generate the X64 optimize results? How do I know the generated CL program/kernel is optimized for X86 or X64.
More information can be found here:
We only compile for your host, so if your host is a x86_64, it will be compiled for x64, if your CPU only supports x86, it will be compiled for x86.