[Wien] speedup with mpi
pluto
pluto at physics.ucdavis.edu
Fri Nov 15 01:08:06 CET 2024
Dear All,
Does anyone has experience in running WIEN2k on a cluster or
supercomputer with both k-parallel and mpi? My interest is in band
structure calculations of large slabs (e.g. >20 atoms). I would
appreciate any comment/remark.
- I am wondering if there is a realistic speedup when using mpi? Can I
have e.g. 10x speedup only from mpi, compared to single core? Will
speedup then multiply with k-parallel?
- Does mpi on a single node also suffer from the memory bandwidth
(related to the number of memory channels on the chipset/mainboard)?
- Has anyone been able to find a workaround for the passwordless ssh for
running on a cluster/supercomputer?
- Is the memory bandwidth problem intrinsic to LAPW, or specific to
WIEN2k? With two memory channels on desktop machines, the k-parallel
speedup is only up to something like 4 cores (depending on OMP a bit).
Actually WIEN2k speed does increase quite a lot with the RAM speed (e.g.
DDR5 7200) -- this is the case on i9-14900 that I am using in the
office.
For few years I have been using an older slurm cluster, and jobs
typically crash when using more that 8 or 10 nodes with k-parallel (also
with small OMP, I think this has been discussed in the mailing list). In
my case each node has 8 cores and 4 memory channels, so it can do 8
k-parallel jobs with practically linear speedup. I would say that in
general extra speedup with OMP is not significant, and the real speedup
is only with k-parallel (but as motioned it is limited by the effective
number of nodes, probably because slurm does not like too many
passwordless ssh connections).
I am asking because I am wondering if an effort to setup WIEN2k on a
supercomputer makes any sense at all. Having a bit faster single core
might not be worth the effort.
Best,
Lukasz
More information about the Wien
mailing list