[Wien] Problems with lapw1_mpi in 64-bit
Iglesias Roberto
roberto.iglesias at psi.ch
Wed Jun 28 14:10:32 CEST 2006
Hi!
SCALAPACK comes from the AMD library amdscalapack-3-1-0-pgi-64bit.
RP RP_LIB(SCALAPACK+PBLAS): -L/opt/scalapack/amdscalapack-3.1-pgi-64/pgi64/lib -lscalapack -lblacsCinit -lblacs -lblacsCinit
/opt/acml/acml-3.1-pgi-64/pgi64/lib/libacml.a -L/opt/mpi/mpich-1.2.7-pgi-6.1-opteron/lib -lmpich
FP FPOPT(par.comp.options): -Mfreeform -fast -Kieee
MP MPIRUN commando : mpirun -np _NP_ -machinefile _HOSTS_ _EXEC_
What was your version?
Kind regards
Roberto
________________________________
De: wien-bounces at zeus.theochem.tuwien.ac.at en nombre de Peter Blaha
Enviado el: mié 28/06/2006 10:31
Para: A Mailing list for WIEN2k users
Asunto: Re: [Wien] Problems with lapw1_mpi in 64-bit
Hi,
I run lapw1_mpi in 64bit Xeon machines without problems.
Have you tried a "smaller problem" ? Could be a memory problem.
mpi seems to work (as also your test say, or try lapw0_mpi), but
maybe there is a problem with SCALAPACK/BLACS. Where does your
SCALAPCK come from ??
Regards
> We're experiencing problems when trying to run our Wien2k installation
> on a cluster of dual core AMD Opteron processors, with 64-bit PGI 6.1
> compilers, MPICH-1.2.6, Scientific Linux 4 (Red Hat 4) and AMD
> libraries. The version of the program is Wien2k_06.2.
>
> We could build and run a test F90 MPICH code in the same environment.
>
> But when running a test, it crashes in lapw1_mpi in the following
> sequence of calls:
>
> run_lapw -p -cc 0.0001
> ...
> /home3/iglesias/w2k_par/x -f TiC lapw1 -p
> ...
> /home3/iglesias/w2k_par/lapw1para lapw1.def
> ...
> mpirun -np 4 -machinefile .machine1 /home3/iglesias/w2k_par/lapw1_mpi
> lapw1_1.def
>
> Using 4 processors, My ID = 0
> 2 - <NO ERROR MESSAGE> : Could not convert index 1140850688 into a pointer
> The index may be an incorrect argument.
> Possible sources of this problem are a missing "include 'mpif.h'",
> a misspelled MPI object (e.g., MPI_COM_WORLD instead of MPI_COMM_WORLD)
> or a misspelled user variable for an MPI object (e.g.,
> com instead of comm).
> p0_10792: p4_error: interrupt SIGx: 13
> Killed by signal 2.
>
> I have not been able to find any solution to this in the mailing list,
> neither in the FAQ. There were messages about compilation errors with a
> 64-bit build, but has someone ever suffered similar problems with lapw1_mpi?
>
> Thanks a lot
>
> Roberto Iglesias
> _______________________________________________
> Wien mailing list
> Wien at zeus.theochem.tuwien.ac.at
> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
>
P.Blaha
--------------------------------------------------------------------------
Peter BLAHA, Inst.f. Materials Chemistry, TU Vienna, A-1060 Vienna
Phone: +43-1-58801-15671 FAX: +43-1-58801-15698
Email: blaha at theochem.tuwien.ac.at WWW: http://info.tuwien.ac.at/theochem/
--------------------------------------------------------------------------
_______________________________________________
Wien mailing list
Wien at zeus.theochem.tuwien.ac.at
http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/ms-tnef
Size: 7089 bytes
Desc: not available
Url : http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20060628/ee6abd91/attachment.bin
More information about the Wien
mailing list