[Wien] Segfault in lapw1_mpi (SL_INIT)

Elias Assmann elias.assmann at gmail.com
Tue Jul 3 10:25:24 CEST 2012


Hello,

When I execute lapw1_mpi, it dies on me immediately:

	$ ./lapw1_mpi
	w2k_dispatch_signal(): received: Segmentation fault
	 Child id           0 SIGSEGV, contact developers
	--------------------------------------------------------------------------
	MPI_ABORT was invoked on rank 0 in communicator MPI_COMM_WORLD
	with errorcode 6.
	
	NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
	You may or may not see output from other processes, depending on
	exactly when Open MPI kills them.
	--------------------------------------------------------------------------

It turns out that the offending line is the first call to SL_INIT in
INIT_PARALLEL (SRC_lapw1/modules.F),

	        SUBROUTINE INIT_PARALLEL
	          IMPLICIT NONE
	#ifdef Parallel
	          include 'mpif.h'
	          INTEGER :: IERR,i,j
	          call MPI_INIT(IERR)
	          call MPI_COMM_SIZE( MPI_COMM_WORLD, NPE, IERR)
	          call MPI_COMM_RANK( MPI_COMM_WORLD, MYID, IERR)
	          CALL BARRIER
->	          CALL SL_INIT(ICTXTALL, 1, NPE)

which is called eventually via GTFNAM at the top of the main program
LAPW1.

I used ifort version 11.1 (specifically, I tried two revisions: 046
and 072) and the corresponding MKL libraries (including ScaLAPACK).
The MPI version is openmpi-1.3.2-icc, in case that matters.  Neither
lapw0_mpi nor lapw2_mpi have this problem (then again, they do not
seem to use SL_INIT).

Any pointers how I should proceed?

Thanks,

	Elias



More information about the Wien mailing list