[Wien] lapwso_mpi error
Md. Fhokrul Islam
fislam at hotmail.com
Thu Dec 8 16:56:21 CET 2016
Hi Prof Blaha,
I am trying to run an MPI job in 2 nodes each with 20 cores. But the job crashes
with the following error messages. I have tried with both USE_REMOTE 0 and
USE_REMOTE 1 in parallel_options file but didn't make much of a deference.
Our system administrator told me it is not probably not a hardware issue and
suggested me to contact Wien2k. So could you please let me know if I need to
make any change in MPI setting and recompile Wien2k.
By the way, the same job runs fine if I use only 1 node with 20 cores.
Error message:
case.dayfile
cycle 1 (Thu Dec 8 15:44:06 CET 2016) (100/99 to go)
> lapw0 -p (15:44:06) starting parallel lapw0 at Thu Dec 8 15:44:07 CET 2016
-------- .machine0 : 40 processors
9872.562u 20.276s 8:20.46 1976.7% 0+0k 220752+386840io 332pf+0w
> lapw1 -up -p -c (15:52:27) starting parallel lapw1 at Thu Dec 8 15:52:27 CET 2016
-> starting parallel LAPW1 jobs at Thu Dec 8 15:52:27 CET 2016
running LAPW1 in parallel mode (using .machines)
1 number_of_parallel_jobs
au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au039 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042 au042(1) --------------------------------------------------------------------------
MPI_ABORT was invoked on rank 8 in communicator MPI_COMM_WORLD
with errorcode -726817712.
Output error file:
LAPW0 END
w2k_dispatch_signal(): received: Terminated
w2k_dispatch_signal(): received: Terminated
forrtl: Interrupted system call
w2k_dispatch_signal(): received: Terminated
w2k_dispatch_signal(): received: Terminated
Thanks,
Fhokrul
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20161208/3fb22082/attachment.html>
More information about the Wien
mailing list