2x AMD EPYC 7351 system only gets 61 GFlop/s with hpl
I am currently benchmarking a few AMD systems in our infrastructure.
I compiled the hpl 2.3 linpack benchmark using amd-blis-1.3 and I only get about 61 GFlops/s.
I already had a look at the HPC tuning guide for EPYC Processors and changed the HPL.dat accordingly but in this case the benchmark runs forever (ok, well, didn't returned a result after at least 3 hours)
Is there something I'm missing?
HPLinpack benchmark input file Innovative Computing Laboratory, University of Tennessee HPL.out output file name (if any) 6 device out (6=stdout,7=stderr,file) 1 # of problems sizes (N) 240000 Ns 1 # of NBs 191 NBs 0 PMAP process mapping (0=Row-,1=Column-major) 1 # of process grids (P x Q) 8 Ps 4 Qs 16.0 threshold 1 # of panel fact 2 PFACTs (0=left, 1=Crout, 2=Right) 1 # of recursive stopping criterium 4 NBMINs (>= 1) 1 # of panels in recursion 2 NDIVs 1 # of recursive panel fact. 2 RFACTs (0=left, 1=Crout, 2=Right) 1 # of broadcast 0 BCASTs (0=1rg,1=1rM,2=2rg,3=2rM,4=Lng,5=LnM) 1 # of lookahead depth 1 DEPTHs (>=0) 2 SWAP (0=bin-exch,1=long,2=mix) 8 swapping threshold 0 L1 in (0=transposed,1=no-transposed) form 0 U in (0=transposed,1=no-transposed) form 1 Equilibration (0=no,1=yes) 8 memory alignment in double (> 0)