[Wien] lapw1 vs lapwso speed
sjalali at sci.ui.ac.ir
sjalali at sci.ui.ac.ir
Sun Jun 11 15:35:32 CEST 2023
Dear Lukasz,
The difference in computation time between lapw1 and lapwso
calculations is expected in band calculations. The lapwso step
involves the calculation of spin-orbit coupling, which can be
computationally more demanding compared to the lapw1 step that
calculates the bands without spin-orbit coupling.
The lapwso calculation includes additional interactions between the
spin of the electron and its orbital motion, which requires more
computational resources and time. Therefore, it is normal to observe a
longer runtime for lapwso compared to lapw1.
In your case, the lapwso process is utilizing a significant portion
of the CPU resources, as indicated by the high CPU usage percentages
(%CPU) in the top output. The memory usage (%MEM) is also relatively
high for the lapwso processes.
It appears that you have allocated sufficient resources (OMP=2) for
the lapwso step, and your system has ample memory available.
Therefore, the longer runtime can be attributed to the inherent
complexity of the spin-orbit coupling calculations rather than a
resource limitation.
If you need to optimize the performance further, you may consider
adjusting the OMP settings or exploring parallelization options with
k-points or MPI to distribute the workload across multiple cores or
processors. However, it's important to note that the total runtime for
the lapwso step will inherently be longer due to the nature of the
calculations involved.
Suggestions for trying:
omp_global:4
#omp_lapw1:2
#omp_lapw2:2
#omp_lapwso:2
In WIEN2k, k-points parallelization can be more efficient. You can use
the testpara_lapw command to assess if increasing the number of
"1:localhost" lines in your .machines file is necessary. testpara_lapw
is a utility program in the WIEN2k package that helps determine the
optimal number of lines (k-points) needed for accurate calculations.
Compiling the code with appropriate optimization flags can
significantly improve the performance and speed of calculations. Here
are some additional suggestions related to code compilation:
Experiment with different optimization levels. Most compilers provide
different optimization levels, such as -O1, -O2, -O3. Higher
optimization levels generally provide better performance but may
increase compilation time. Use the suggestion of the siteconfig_lapw
for the right balance of the code.
Ensure that you are using the latest version of the code, i.e.,
WIEN2k_23.2. The respectful Developers have recently released
significant updates and bug fixes that can improve performance, super
thanks to Peter Blaha and all the developers.
Best regards,
Saeid
Quoting pluto via Wien <wien at zeus.theochem.tuwien.ac.at>:
> Dear All,
>
> When calculating bands for a large slab I have following sequence:
>
> Sun May 14 12:33:03 PM CEST 2023> (x) lapw1 -band -up -p
> Sun May 14 02:25:26 PM CEST 2023> (x) lapw1 -band -dn -p
> Sun May 14 04:17:22 PM CEST 2023> (x) lapwso -up -p
> Mon May 15 01:30:05 AM CEST 2023> (x) qtl -up -p -band -so
> Mon May 15 01:30:05 AM CEST 2023> (x) lapw2 -p -fermi -so -up
> Mon May 15 01:51:51 AM CEST 2023> (x) qtl -dn -p -band -so
> Mon May 15 01:51:51 AM CEST 2023> (x) lapw2 -p -fermi -so -dn
>
> As you can see lapwso takes much longer than lapw1 (approx. 9h vs
> 2h). Is this normal for band calculations?
>
> I have 128 GB of RAM in this computer, so this is not a RAM issue.
> Here is what top shows for the lapwso calculation (I have 4 parallel
> localhost processes in .machines, OMP=2 and no mpi):
>
> Tasks: 505 total, 2 running, 503 sleeping, 0 stopped, 0 zombie
> %Cpu(s): 24.0 us, 0.2 sy, 0.0 ni, 75.7 id, 0.0 wa, 0.1 hi, 0.0
> si, 0.0 st
> MiB Mem : 128047.1 total, 1845.8 free, 16809.0 used, 111158.6 buff/cache
> MiB Swap: 32088.0 total, 31471.5 free, 616.5 used. 111238.1 avail Mem
>
> PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
> 1336417 lplucin 20 0 6417856 4.8g 15840 R 199.3 3.8 1294:13 lapwso
> 1336392 lplucin 20 0 2848204 2.3g 15880 S 146.8 1.9 1295:30 lapwso
> 1336391 lplucin 20 0 2848188 2.4g 15916 S 130.6 1.9 1304:23 lapwso
> 1336396 lplucin 20 0 2848060 2.3g 15816 S 99.7 1.9 1288:06 lapwso
>
> .machines file:
>
> omp_global:8
> omp_lapw1:2
> omp_lapw2:2
> omp_lapwso:2
> 1:localhost
> 1:localhost
> 1:localhost
> 1:localhost
> granularity:1
>
> Best,
> Lukasz
> _______________________________________________
> Wien mailing list
> Wien at zeus.theochem.tuwien.ac.at
> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wienSEARCH the
> MAILING-LIST at:
> http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20230611/abca7a5c/attachment.htm>
More information about the Wien
mailing list