[Wien] dstart_mpi error
Gavin Abo
gsabo at crimson.ua.edu
Thu Jul 19 04:23:25 CEST 2018
As the error message says, one possible cause is the connection being
blocked by a firewall.
Another possible cause is a ssh passwordless access problem:
https://stackoverflow.com/questions/19565795/unable-to-execute-mpich2-on-multiple-machines-on-ubuntu-12-04-hydu-sock-connect
Yet, another possible cause is a problem resolving the DNS hostname:
https://forums.suse.com/archive/index.php/t-6057.html
https://www.slothparadise.com/running-mpi-common-mpi-troubleshooting-problems/
Since /etc/hosts usually cannot be edited by a user, the cluster
administrator would have to fix the hosts file if that happens to be the
source of the problem.
On 7/18/2018 6:07 PM, karima Physique wrote:
> Dear wien2k users:
>
> Using the folowing machines files :
> lapw0:master:12
> dstart:master:12
> 1:master:12
> 1:node1:12
> 1:node2:12
> ......
> the calculation works very well, but using the following machines file:
> lapw0:master:12 node1:12 node2:12
> dstart:master:12 node1:12 node2:12
> 1:master:12
> 1:node1:12
> 1:node2:12
> .......
> I got the following error:
>
> unable to get host adress calcul.local for (1)
> unable to connect to server calcul.local at port 44295 (chek for
> firewalls!)
> we note that calcul.local is the host to connect to w2web.
> I ask you any suggestions to solve this problem
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20180718/a8d243a8/attachment.html>
More information about the Wien
mailing list