[Wien] parallel job error
jadhikari@clarku.edu
jadhikari at clarku.edu
Mon Feb 26 17:47:25 CET 2007
Dear Wien users,
The calculation seems to stop with "( cd $PWD; $t $exe ${def}_$loop.def;
rm -f .lock_$lockfile[$p] ) >> ..." as the last line in the dayfile.
Single mode runs fine but with 4 processors it never works.
I understand this error but cannot fix it. Please let me know about this.
Thank you.
Subin
Following is the dayfile for a parallel job-
_______________________________________________________________________
start (Mon Feb 26 09:59:35 EST 2007) with lapw0 (80/20 to go)
cycle 1 (Mon Feb 26 09:59:35 EST 2007) (80/20 to go)
> lapw0 -p (09:59:35) starting parallel lapw0 at Mon Feb 26 09:59:35
EST 2007
--------
running lapw0 in single mode
80.915u 0.224s 1:21.45 99.6% 0+0k 0+0io 5pf+0w
> lapw1 -p (10:00:56) starting parallel lapw1 at Mon Feb 26 10:00:56
EST 2007
-> starting parallel LAPW1 jobs at Mon Feb 26 10:00:56 EST 2007
running LAPW1 in parallel mode (using .machines)
4 number_of_parallel_jobs
[1] 20280
[2] 20295
[3] 20310
[4] 20325
[2] Done ( cd $PWD; $t $exe ${def}_$loop.def;
rm -f .lock_$lockfile[$p] ) >> ...
_____________________________________________________________________
More information about the Wien
mailing list