[Wien] Quad Core Benchmark

Laurence Marks L-marks at northwestern.edu
Mon Jan 7 01:29:31 CET 2008


Xeon X3210 2.13GHz Quad Core (2x2Duo Core)
ifort 10.1.008 cmkl 10.0.1.014
FOPT =  -FR -mp1 -w -prec_div -pc80 -pad -ip -DINTEL_VML -xT
-mtune=core2 -O3 -thread -fminshared
LDFLAGS = $(FOPT) -L/opt/intel/mkl/10.0.1.014/lib/em64t -static
R_LIBS = -lmkl_lapack -lguide -pthread

1 Job  1 Thread            140 Secs
2 Jobs 1 Thread Each   150 Secs Each
1 Job   2 Threads           88 Secs
2 Jobs 2 Threads Each  112 Secs Each
4 Jobs 1 Thread Each    228 Secs Each

MPI Performance
Times only (Note: CPU Times for 2 Threads are not correct, they are a
sum over threads)
1 MPI, 1 Node, 1 Thread     1423 HAMILT (CPU )  =   223.0, HNS =
174.1, DIAG =  1021.4
1 MPI, 1 Node, 2 Threads   1038 HAMILT (CPU )  =   385.4, HNS =
194.3, DIAG =  1430.8
2 MPI, 1 Node, 1 Thread     1242 HAMILT (WALL) =   120.3, HNS =
129.8, DIAG =   988.4
2 MPI, 1 Node, 2 Threads   1105 HAMILT (WALL) =   130.6, HNS =
112.1, DIAG =   859.5
4 MPI, 1 Node, 1 Thread     1175 HAMILT (WALL) =    80.1, HNS =
116.8, DIAG =   977.9

Comments:
1) Intel has introduced a host of new environmental parameters so it
might be better to do better than this with the "right" options, but
probably not by much.
2) Even though OMP_NUM_THREADS=1 or 2 the documentation indicates that
this may not be honored.
3) 1 Job with 4 Threads is unstable. At best perhaps 80 seconds, at
worse it crashes.


-- 
Laurence Marks
Department of Materials Science and Engineering
MSE Rm 2036 Cook Hall
2220 N Campus Drive
Northwestern University
Evanston, IL 60208, USA
Tel: (847) 491-3996 Fax: (847) 491-7820
email: L-marks at northwestern dot edu
Web: www.numis.northwestern.edu
Commission on Electron Diffraction of IUCR
www.numis.northwestern.edu/IUCR_CED


More information about the Wien mailing list