[Wien] lapw1 vs lapwso speed

sjalali at sci.ui.ac.ir sjalali at sci.ui.ac.ir
Sun Jun 11 15:35:32 CEST 2023


  Dear Lukasz,

  The difference in computation time between lapw1 and lapwso  
calculations is expected in band calculations. The lapwso step  
involves the calculation of spin-orbit coupling, which can be  
computationally more demanding compared to the lapw1 step that  
calculates the bands without spin-orbit coupling.

  The lapwso calculation includes additional interactions between the  
spin of the electron and its orbital motion, which requires more  
computational resources and time. Therefore, it is normal to observe a  
longer runtime for lapwso compared to lapw1.

  In your case, the lapwso process is utilizing a significant portion  
of the CPU resources, as indicated by the high CPU usage percentages  
(%CPU) in the top output. The memory usage (%MEM) is also relatively  
high for the lapwso processes.

  It appears that you have allocated sufficient resources (OMP=2) for  
the lapwso step, and your system has ample memory available.  
Therefore, the longer runtime can be attributed to the inherent  
complexity of the spin-orbit coupling calculations rather than a  
resource limitation.

  If you need to optimize the performance further, you may consider  
adjusting the OMP settings or exploring parallelization options with  
k-points or MPI to distribute the workload across multiple cores or  
processors. However, it's important to note that the total runtime for  
the lapwso step will inherently be longer due to the nature of the  
calculations involved.

Suggestions for trying:
omp_global:4
#omp_lapw1:2
#omp_lapw2:2
#omp_lapwso:2

In WIEN2k, k-points parallelization can be more efficient. You can use  
the testpara_lapw command to assess if increasing the number of  
"1:localhost" lines in your .machines file is necessary. testpara_lapw  
is a utility program in the WIEN2k package that helps determine the  
optimal number of lines (k-points) needed for accurate calculations.

Compiling the code with appropriate optimization flags can  
significantly improve the performance and speed of calculations. Here  
are some additional suggestions related to code compilation:
Experiment with different optimization levels. Most compilers provide  
different optimization levels, such as -O1, -O2, -O3. Higher  
optimization levels generally provide better performance but may  
increase compilation time. Use the suggestion of the siteconfig_lapw  
for the right balance of the code.
Ensure that you are using the latest version of the code, i.e.,  
WIEN2k_23.2. The respectful Developers have recently released  
significant updates and bug fixes that can improve performance, super  
thanks to Peter Blaha and all the developers.

Best regards,
Saeid


Quoting pluto via Wien <wien at zeus.theochem.tuwien.ac.at>:

> Dear All,
>
> When calculating bands for a large slab I have following sequence:
>
> Sun May 14 12:33:03 PM CEST 2023> (x) lapw1 -band -up -p
> Sun May 14 02:25:26 PM CEST 2023> (x) lapw1 -band -dn -p
> Sun May 14 04:17:22 PM CEST 2023> (x) lapwso -up -p
> Mon May 15 01:30:05 AM CEST 2023> (x) qtl -up -p -band -so
> Mon May 15 01:30:05 AM CEST 2023> (x) lapw2 -p -fermi -so -up
> Mon May 15 01:51:51 AM CEST 2023> (x) qtl -dn -p -band -so
> Mon May 15 01:51:51 AM CEST 2023> (x) lapw2 -p -fermi -so -dn
>
> As you can see lapwso takes much longer than lapw1 (approx. 9h vs  
> 2h). Is this normal for band calculations?
>
> I have 128 GB of RAM in this computer, so this is not a RAM issue.  
> Here is what top shows for the lapwso calculation (I have 4 parallel  
> localhost processes in .machines, OMP=2 and no mpi):
>
> Tasks: 505 total,   2 running, 503 sleeping,   0 stopped,   0 zombie
> %Cpu(s): 24.0 us,  0.2 sy,  0.0 ni, 75.7 id,  0.0 wa,  0.1 hi,  0.0  
> si,  0.0 st
> MiB Mem : 128047.1 total,   1845.8 free,  16809.0 used, 111158.6 buff/cache
> MiB Swap:  32088.0 total,  31471.5 free,    616.5 used. 111238.1 avail Mem
>
>    PID USER      PR  NI    VIRT    RES    SHR S  %CPU  %MEM     TIME+ COMMAND
> 1336417 lplucin   20   0 6417856   4.8g  15840 R 199.3   3.8   1294:13 lapwso
> 1336392 lplucin   20   0 2848204   2.3g  15880 S 146.8   1.9   1295:30 lapwso
> 1336391 lplucin   20   0 2848188   2.4g  15916 S 130.6   1.9   1304:23 lapwso
> 1336396 lplucin   20   0 2848060   2.3g  15816 S  99.7   1.9   1288:06 lapwso
>
> .machines file:
>
> omp_global:8
> omp_lapw1:2
> omp_lapw2:2
> omp_lapwso:2
> 1:localhost
> 1:localhost
> 1:localhost
> 1:localhost
> granularity:1
>
> Best,
> Lukasz
> _______________________________________________
> Wien mailing list
> Wien at zeus.theochem.tuwien.ac.at
> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wienSEARCH the  
> MAILING-LIST at:   
> http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20230611/abca7a5c/attachment.htm>


More information about the Wien mailing list