cancel
Showing results for 
Search instead for 
Did you mean: 

Archives Discussions

kbrafford
Adept II

float4 int4 and Phenom II x4 and x6

With a CPU context, if I use float4 and int4 data types, if all I am doing are literal loads,  additions, subtractions and multiplications, will my code be compiled into SIMD instructions that use MMX/SSE/etc?

0 Likes
2 Replies
nou
Exemplar

yes

0 Likes

So when people say that AMD has a slightly lesser SSE implementation than Intel (thus explaining why some video codecs perform better on Intel than AMD), what are they talking about? and what are the ramifications for OpenCL kernels?

Here's part of a table from AMD's website.  I am guessing the difference is SSE4a vs. SSE4.  Are there OpenCL features that will work better on Intel than AMD because of this?

AMD Phenom Processors

 

Intel Core 2 Quad

 

3D & Multimedia instructions

3DNow!™ technology, SSE, SSE2, SSE3, SSE4a

SSE, SSE2, SSE3, SSE4

0 Likes