[Wien] A problem in a parallel execution of WIEN2k_8.1

Peter Blaha pblaha at theochem.tuwien.ac.at
Fri Jan 25 09:32:50 CET 2008


The relevant output is in the following lines (with remotemachine):

...
if ( 0 > 1 ) echo running parallel lapw2
touch .lock_xps011
echo -n xps01
set ttt= ( `echo $mpirun | sed -e "s^_NP_^$number_per_job2[$loop]^" -e "s^_EXEC_^$WIENROOT/${exe}_mpi ${def}_$loop.def $loop^" -e "s^_HOSTS_^.machine$mach[$loop]^"` )
echo mpirun -np _NP_ -machinefile _HOSTS_ _EXEC_
sed -e s^_NP_^4^ -e s^_EXEC_^/usr/local/WIEN2k_08.1/lapw2_mpi lapw2_1.def 1^ -e s^_HOSTS_^.machine1^
if ( 1 == 1 ) then
set remote = rsh
remotemachine: Undefined variable.
else
...

If I compare this with the source of lapw2para_lapw, at first I cannot locate this in the
present version of lapw2para_lapw.

However, the problem is during installation due to siteconfig_lapw. siteconfig_lapw
changes the line:

	         set remotemachine = `head -1 .machine$mach[$loop]`
to
                 set remote = rsh

leading to your problems.
--------------------------------------------------
Fix:

copy the attached  siteconfig_lapw file into $WIENROOT
cd $WIENROOT
cp SRC/lapw2para_lapw $WIENROOT

either:
  edit lapw2para_lapw  and change   set remote = ssh     to  rsh   (only if you are using rsh)
or:
./siteconfig
   configure Parallel execution
       and select again the necessary options (shared memory y/n;  rsh or ssh;  mpi  y/n)
-----------

Since this is quite serious, I've updated the sources on the web (still keeping Version-Nr. WIEN2k_08.1)




> In addition, I did the -xf test. A result, I could get a message in
> STDOUT (attached in this mail).   
> But I don't know what I can do for this message.
> 
>>>>> I have a problem in a parallel execution of lapw2. It looks like lapw0
>>>>> and lapw1 were completed in parallel environment without any problem,
>>>>> but when it proceeds to lapw2, it aborts with an error message:
>>>>> --
>>>>> LAPW2 - FERMI; weighs written
>>>>> remotemachine: Undefined variable.
>>>>> remotemachine: Undefined variable.
>>>>> remotemachine: Undefined variable.
>>>>> remotemachine: Undefined variable.
>>>>> cp: cannot stat `.in.tmp': No such file or directory
>>>>> rm: cannot remove `.in.tmp': No such file or directory
>>>>> rm: cannot remove `.in.tmp1': No such file or directory
>>>>> --
-- 

                                      P.Blaha
--------------------------------------------------------------------------
Peter BLAHA, Inst.f. Materials Chemistry, TU Vienna, A-1060 Vienna
Phone: +43-1-58801-15671             FAX: +43-1-58801-15698
Email: blaha at theochem.tuwien.ac.at    WWW: http://info.tuwien.ac.at/theochem/
--------------------------------------------------------------------------
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: siteconfig_lapw
Url: http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20080125/3a4617cb/siteconfig_lapw.ksh


More information about the Wien mailing list