[Wien] Problem with k-parallel in version 24.1?

Peter Blaha peter.blaha at tuwien.ac.at
Wed Oct 16 09:17:13 CEST 2024


> In case.dayfile, it may show something like this at the cycle where it 
> crashes:
> 
>        bb4u14c1 10.023u 17.606s 28.77 96.02% 0+0k 0+0io 0pf+0w

A timing statement like this tells you immediately that the job is not 
running well.
Your program (lapw2) needs 17 sec cpu time, but there is an additional 
10 second !!! system time (probably waiting for I/O).
The system time should always be much smaller.

....
> 
>        bb4u20c1 8.505u 1.509s 10.77 92.90% 0+0k 0+0io 0pf+0w
> 
>     Summary of lapw2para:
> 
>     bb4u14c1      user=452.054    wallclock=84644
> 
>     bb4u15c1      user=8.589      wallclock=712.33
> 
> **  LAPW2 crashed!
> 
> 3.440u 20.832s 1:12.41 33.5%    0+0k 0+67272io 12pf+0w

And this is even worse: Your (failed) job needed 72 seconds, although 
the cpu time of one step is only 17 seconds.

Always check the timing in the dayfile.   MORE parallelization 
(k-points, omp, mpi-cores) does NOT NECESSARILY mean, that it will run 
faster, but it can even dramatically SLOW DOWN.

PS: Often, the speed is also limited by memory access, so it could be 
that on a 64 core machine you should actually use only 32 .... But all 
these things are really hardware specific and you MUST do your personal 
timing tests to find out the best setup.

> 
> error: command   /home/yz155/WIEN2k_24.1/lapw2cpara -dn -c dnlapw2.def   
> failed
> 
> 
>  >   stop error
> 
> 
> As one can see for example, some cores on node bb4u14c1 did not run its 
> lapw2 -dn job, therefore no corresponding scf2dn file produced. The 
> dnlapw2 def files were produced though.
> 
> 
> 
> Best regards
> 
> Yichen
> 
> 
> _______________________________________________
> Wien mailing list
> Wien at zeus.theochem.tuwien.ac.at
> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
> SEARCH the MAILING-LIST at:  http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html

-- 
-----------------------------------------------------------------------
Peter Blaha,  Inst. f. Materials Chemistry, TU Vienna, A-1060 Vienna
Phone: +43-158801165300
Email: peter.blaha at tuwien.ac.at
WWW:   http://www.imc.tuwien.ac.at      WIEN2k: http://www.wien2k.at
-------------------------------------------------------------------------



More information about the Wien mailing list