[Wien] wien in parallel mode is slower than serial mode
nilton@ufba.br
nilton at ufba.br
Fri Dec 21 21:25:38 CET 2007
Dear Stefaan Cottenier
Thank very much for your answer.
> 1) I agree with Florent that it does not make much sense to base
> conclusions on such small jobs. Try something that has at least one
> minute execution time for lapw1.
I did that calculation for one large system (please see below) and the
results do not changed. As you can see my system have matrix size of
almost 4000 and the cpu time is almost 16 min to lapw1 what satisfy
your remark. I am not using mpi because according wien user's guide is
not necessary to shared memory system(as I said in my first e-mail I
have core 2 quad system).
.....
> compiled lapw1 with OMP_NUM_TRHEADS=1, while for lapw2 it is 4, and
> you probably run this on a quadcore cpu...? In that case, lapw2 would
> be somewhat parallellized even in a serial run, while lapw1 is not.
I afraid I not get your point. Please, could you tell where I can find
this variable OMP_NUM_THREADS? I searched in Makefile and in lapw1
files and don't found it.
regards,
Nilton
Here is the output of my system. As you can see it has the same
behavior as before, so, to solve my problem I need understand why
lapw2 spends long time even when compared with serial mode. In
parallel mode each instance of lapw1 and 2 need work with small number
of k-points than in serial so that do not make sense lapw2 take more
time in each operation.
------------------run in parallel mode-----------------------------
:RKM : MATRIX SIZE 3957LOs: 360 RKM= 8.99 WEIGHT= 1.00 PGR:
running lapw0 in single mode
81.944u 0.645s 0:39.98 206.5% 0+0k 0+7952io 0pf+0w
> lapw1 -p (15:30:36) starting parallel lapw1 at Fri Dec 21
> 15:30:36 BRT 2007
-> starting parallel LAPW1 jobs at Fri Dec 21 15:30:36 BRT 2007
running LAPW1 in parallel mode (using .machines)
4 number_of_parallel_jobs
localhost(6) 725.214u 2.050s 16:34.90 73.0% 0+0k 0+159168io 0pf+0w
localhost(6) 684.668u 2.080s 15:57.46 71.7% 0+0k 0+152816io 0pf+0w
localhost(6) 699.967u 2.097s 16:10.79 72.3% 0+0k 8+153688io 0pf+0w
localhost(6) 688.346u 1.890s 16:02.18 71.7% 0+0k 8+154800io 0pf+0w
Summary of lapw1para:
localhost k=24 user=2798.2 wallclock=3885.33
2799.483u 9.766s 16:36.02 282.0% 0+0k 16+621200io 0pf+0w
> lapw2 -p (15:47:12) running LAPW2 in parallel mode
localhost 1246.379u 41.656s 19:13.26 111.6% 0+0k 0+11392io 0pf+0w
localhost 952.876u 34.233s 16:58.17 96.9% 0+0k 0+11392io 0pf+0w
localhost 552.495u 19.128s 11:48.03 80.7% 0+0k 0+11392io 0pf+0w
localhost 781.333u 27.519s 15:09.69 88.9% 0+0k 0+11392io 0pf+0w
Summary of lapw2para:
localhost user=3533.08 wallclock=3789.15
3533.914u 122.942s 19:15.69 316.4% 0+0k 8+58696io 0pf+0w
> lcore (16:06:28) 0.119u 0.061s 0:00.24 70.8% 0+0k 0+4248io 0pf+0w
> mixer (16:06:28) 1.053u 0.180s 0:00.98 125.5% 0+0k 0+14200io 0pf+0w
:ENERGY convergence: 1 0.0001 .0000100000000000
:CHARGE convergence: 0 0.0000 .0005089
ec cc and fc_conv 1 1 1
> stop
----------------------------------------------------------------
Universidade Federal da Bahia - http://www.portal.ufba.br
More information about the Wien
mailing list