cancel
Showing results for 
Search instead for 
Did you mean: 

Archives Discussions

zhuzxy
Journeyman III

How to utilize the auto vectorizing advantage for OpenCL on CPU?

when I use vector in the opencl code, like int4/char 4/uchar 8, is it guaranteed to be translated using SSE instructions on CPU?  It seems not in my cl code after check the amd app kernel analyzer. What shall I pay attention to in my cl code when I want to use the vectorize abilities for my opencl on CPU?

0 Likes
1 Reply
genaganna
Journeyman III

Originally posted by: zhuzxy when I use vector in the opencl code, like int4/char 4/uchar 8, is it guaranteed to be translated using SSE instructions on CPU?  It seems not in my cl code after check the amd app kernel analyzer. What shall I pay attention to in my cl code when I want to use the vectorize abilities for my opencl on CPU?


It will generate SSE instructions if you use vector types and there is a equivalent SSE instructions exist.

Please paste your kernel code there which allows us to verify whether it is generated or not

0 Likes