[Wien] Problems with lapw1_mpi in 64-bit
Roberto Iglesias
roberto.iglesias at psi.ch
Wed Jun 28 10:11:25 CEST 2006
Hello all!
We're experiencing problems when trying to run our Wien2k installation
on a cluster of dual core AMD Opteron processors, with 64-bit PGI 6.1
compilers, MPICH-1.2.6, Scientific Linux 4 (Red Hat 4) and AMD
libraries. The version of the program is Wien2k_06.2.
The build was practically clean:
Compile time errors (if any) were:
SRC_balsac-utils/compile.msg:make: [clean] Error 1 (ignored)
SRC_elast/compile.msg:make: [clean] Error 1 (ignored)
SRC_eosfit/compile.msg:make: [clean] Error 1 (ignored)
SRC_fsgen/compile.msg:make: [clean] Error 1 (ignored)
SRC_optimize/compile.msg:make: [clean] Error 1 (ignored)
SRC_phonon/compile.msg:make: [clean] Error 1 (ignored)
SRC_trig/compile.msg:make: [clean] Error 1 (ignored)
We could build and run a test F90 MPICH code in the same environment.
But when running a test, it crashes in lapw1_mpi in the following
sequence of calls:
run_lapw -p -cc 0.0001
...
/home3/iglesias/w2k_par/x -f TiC lapw1 -p
...
/home3/iglesias/w2k_par/lapw1para lapw1.def
...
mpirun -np 4 -machinefile .machine1 /home3/iglesias/w2k_par/lapw1_mpi
lapw1_1.def
Using 4 processors, My ID = 0
2 - <NO ERROR MESSAGE> : Could not convert index 1140850688 into a pointer
The index may be an incorrect argument.
Possible sources of this problem are a missing "include 'mpif.h'",
a misspelled MPI object (e.g., MPI_COM_WORLD instead of MPI_COMM_WORLD)
or a misspelled user variable for an MPI object (e.g.,
com instead of comm).
p0_10792: p4_error: interrupt SIGx: 13
Killed by signal 2.
I have not been able to find any solution to this in the mailing list,
neither in the FAQ. There were messages about compilation errors with a
64-bit build, but has someone ever suffered similar problems with lapw1_mpi?
Thanks a lot
Roberto Iglesias
More information about the Wien
mailing list