[Wien] dstart_mpi error

Gavin Abo gsabo at crimson.ua.edu
Thu Jul 19 04:23:25 CEST 2018


As the error message says, one possible cause is the connection being 
blocked by a firewall.

Another possible cause is a ssh passwordless access problem:

https://stackoverflow.com/questions/19565795/unable-to-execute-mpich2-on-multiple-machines-on-ubuntu-12-04-hydu-sock-connect

Yet, another possible cause is a problem resolving the DNS hostname:

https://forums.suse.com/archive/index.php/t-6057.html
https://www.slothparadise.com/running-mpi-common-mpi-troubleshooting-problems/

Since /etc/hosts usually cannot be edited by a user, the cluster 
administrator would have to fix the hosts file if that happens to be the 
source of the problem.

On 7/18/2018 6:07 PM, karima Physique wrote:
> Dear wien2k users:
>
> Using the folowing machines files :
> lapw0:master:12
> dstart:master:12
> 1:master:12
> 1:node1:12
> 1:node2:12
> ......
> the calculation works very well, but using the following machines file:
> lapw0:master:12 node1:12 node2:12
> dstart:master:12 node1:12 node2:12
> 1:master:12
> 1:node1:12
> 1:node2:12
> .......
> I got the following error:
>
> unable to get host adress calcul.local for (1)
> unable to connect to server calcul.local at port 44295 (chek for 
> firewalls!)
> we note that calcul.local is the host to connect to w2web.
> I ask you any suggestions to solve this problem
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20180718/a8d243a8/attachment.html>


More information about the Wien mailing list