[Wien] parallel job error

jadhikari@clarku.edu jadhikari at clarku.edu
Mon Feb 26 17:47:25 CET 2007


Dear Wien users,

The calculation seems to stop with "( cd $PWD; $t $exe ${def}_$loop.def;
rm -f .lock_$lockfile[$p] ) >>  ..." as the last line in the dayfile.
Single mode runs fine but with 4 processors it never works.

I understand this error but cannot fix it. Please let me know about this.

Thank you.

Subin


Following is the dayfile for a parallel job-
_______________________________________________________________________
    start       (Mon Feb 26 09:59:35 EST 2007) with lapw0 (80/20 to go)

    cycle 1     (Mon Feb 26 09:59:35 EST 2007)  (80/20 to go)

>   lapw0 -p    (09:59:35) starting parallel lapw0 at Mon Feb 26 09:59:35
EST 2007
--------
running lapw0 in single mode
80.915u 0.224s 1:21.45 99.6%    0+0k 0+0io 5pf+0w
>   lapw1  -p   (10:00:56) starting parallel lapw1 at Mon Feb 26 10:00:56
EST 2007
->  starting parallel LAPW1 jobs at Mon Feb 26 10:00:56 EST 2007
running LAPW1 in parallel mode (using .machines)
4 number_of_parallel_jobs
[1] 20280
[2] 20295
[3] 20310
[4] 20325
[2]    Done                          ( cd $PWD; $t $exe ${def}_$loop.def;
rm -f .lock_$lockfile[$p] ) >>  ...
_____________________________________________________________________


More information about the Wien mailing list