zeland

incorect calculation of FLOPS in time_dgemm.f

Discussion created by zeland on Sep 7, 2010
Latest reply on Sep 20, 2010 by zeland

size of problem for DGEMM in case alfa and beta not equal 1 or 0 is 3*N*K*M but in time_dgemm.f ACML-GPU Linux x64 DNFLOP = 2.0D-6*DBLE(M)*DBLE(N)*DBLE(K)

I suppose  that right is DNFLOP = 3.0D-6*DBLE(M)*DBLE(N)*DBLE(K)

Outcomes