[Wien] Mpi &ssh ssue (not a bug)

Laurence Marks L-marks at northwestern.edu
Thu Nov 14 14:31:04 CET 2013


I am posting this for general information only. In some cases (rare) the
mpi versions of Wien2k can hang forever when ssh is being used as a
launcher because one of the ssh process has become a zombie. This can occur
with impi and mvapich, perhaps others as well.

One reason (there may be others) is a hardware problem, in the case I can
reproduce a heating problem on one node, which showed up in
/var/log/messages. The actions taken by kernel left the ssh connection as a
zombie, and it appears that current mpi versions do not trap this.

---------------------------
Professor Laurence Marks
Department of Materials Science and Engineering
Northwestern University
www.numis.northwestern.edu 1-847-491-3996
"Research is to see what everybody else has seen, and to think what nobody
else has thought"
Albert Szent-Gyorgi
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20131114/78b04920/attachment.htm>


More information about the Wien mailing list