[Wien] configuring parallel options using ssh

Luc Fruchter luc.fruchter at u-psud.fr
Mon Sep 10 20:56:56 CEST 2018


Dear users,

I failed configuring the parallel options to run cases on several 
machines, each of them with several CPUs, driven by ssh protocol.

* Configuring the parallel options with: shared memory, MPI = 0, ssh 
protocol, allows to run parallel jobs using several CPUs on the same 
machine. However, a .machines file with several machines will run using 
all required CPUs on the machine where launched (ignoring hosts).

- Configuring with: no shared memory, MPI = 0, ssh protocol, will run no 
parallel jobs, either on the same or different machines (Below is the 
output for the error in this case).

All machines communicate without problem with ssh and no password, and 
have identical file paths.

Thanks for helping

------------------------------------------------------------------

 >   lapw0  -p	(20:33:36) starting parallel lapw0 at Mon Sep 10 20:33:36 
CEST 2018
-------- .machine0 : processors
running lapw0 in single mode
6.793u 0.073s 0:06.86 100.0%	0+0k 0+5152io 0pf+0w
 >   lapw1  -p    	(20:33:43) starting parallel lapw1 at Mon Sep 10 
20:33:43 CEST 2018
->  starting parallel LAPW1 jobs at Mon Sep 10 20:33:43 CEST 2018
running LAPW1 in parallel mode (using .machines)
1 number_of_parallel_jobs
      localhost(48)    Summary of lapw1para:
    localhost	 k=48	 user=0	 wallclock=0
0.112u 0.158s 0:02.28 11.4%	0+0k 0+224io 0pf+0w
 >   lapw2 -p     	(20:33:45) running LAPW2 in parallel mode
**  LAPW2 crashed!
0.085u 0.062s 0:00.13 107.6%	0+0k 0+872io 0pf+0w
error: command   /root/Documents/WIEN2KROOT/lapw2para lapw2.def   failed

 >   stop error


More information about the Wien mailing list