rick.weber

Emitting dmad in OpenCL

Discussion created by rick.weber on Aug 20, 2010
Latest reply on Aug 21, 2010 by rick.weber

Is there a way to get the compiler to emit dmad instructions without calling mad() or fma()? I looked at the fma macro and it does the following:

It seems the 4 mov instructions are extraneous and the in0, in1, in2, and out0 registers can be directly fed into the dmad instruction.

mdef(358)_out(1)_in(3) mov r0, in0 mov r1, in1 mov r2, in2 dmad r0.xy__, r0.xy, r1.xy, r2.xy mov out0, r0 mend

Outcomes