[Wien] a parallel error of lapw0 with MBJLDA potential (updated)

wanxiang feng fengwanxiang at gmail.com
Thu Jun 10 05:21:01 CEST 2010


Honorable Professor Blaha,

I calculate GaAs and Ge with MBJLDA potential following the steps in
section 4.5.8 of userguide, but I have in trouble with the parallel of
lapw0.
First, the bandgap is different with or without the lapw0 parallel.

bandgap of Ge is:
0.85eV (without lapw0 parallel)
0.71eV (with lapw0 parallel)

bandgap of GaAs is:
1.61eV (without lapw0 parallel)
???  (with lapw0 parallel)


Second, there is an error when I calculate GaAs using lapw0 parallel,
the error files are:

=========== GaAs.dayfile
===========================================================================

    start       (Thu Jun 10 00:03:22 CST 2010) with lapw0 (40/99 to go)

    cycle 1     (Thu Jun 10 00:03:22 CST 2010)  (40/99 to go)

>   lapw0 -grr -p       (00:03:22) starting parallel lapw0 at Thu Jun 10 00:03:23 CST 2010
-------- .machine0 : 8 processors
1.522u 0.702s 0:07.17 30.9%     0+0k 0+0io 0pf+0w
>   lapw0 -p    (00:03:30) starting parallel lapw0 at Thu Jun 10 00:03:30 CST 2010
-------- .machine0 : 8 processors
rm_l_1_8923: (1.867188) net_send: could not write to fd=6, errno = 9
rm_l_1_8923:  p4_error: net_send write: -1
p4_6838:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_4_6885: (1.066406) net_send: could not write to fd=5, errno = 32
p5_6889:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_5_6936: (0.804688) net_send: could not write to fd=5, errno = 32
p7_6991:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_7_7038: (0.281250) net_send: could not write to fd=5, errno = 32
p6_6940:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_6_6987: (0.546875) net_send: could not write to fd=5, errno = 32
p2_8929:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_2_8977: (1.597656) net_send: could not write to fd=5, errno = 32
p3_8983:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_3_9030: (1.332031) net_send: could not write to fd=5, errno = 32
**  lapw0 crashed!
0.214u 0.256s 0:03.69 12.4%     0+0k 0+0io 0pf+0w
error: command   /home/wxfeng/apps/WIEN2k_10.1/lapw0para -c lapw0.def   failed

>   stop error


======== lapw0.error
=================================================================================

**  Error in Parallel lapw0
**  lapw0 STOPPED at Thu Jun 10 00:03:33 CST 2010
**  check ERROR FILES!


========= standard output
=============================================================================

 LAPW0 END
 LAPW0 END
 LAPW0 END
 LAPW0 END
 LAPW0 END
 LAPW0 END
 LAPW0 END
 LAPW0 END
forrtl: severe (104): incorrect POSITION= specifier value for
connected file, unit 11, file /pub/wxfeng/WIEN2k/GaAs/GaAs.r2v
Image              PC                Routine            Line
Source
lapw0_mpi          00000000005A6981  Unknown               Unknown  Unknown
lapw0_mpi          00000000005A5955  Unknown               Unknown  Unknown
lapw0_mpi          0000000000555BFA  Unknown               Unknown  Unknown
lapw0_mpi          00000000005179E5  Unknown               Unknown  Unknown
lapw0_mpi          00000000005170D2  Unknown               Unknown  Unknown
lapw0_mpi          0000000000524840  Unknown               Unknown  Unknown
lapw0_mpi          000000000043AEB4  MAIN__                   1636  lapw0.F
lapw0_mpi          0000000000405B3C  Unknown               Unknown  Unknown
libc.so.6          0000002A9707D3FB  Unknown               Unknown  Unknown
lapw0_mpi          0000000000405A6A  Unknown               Unknown  Unknown
    p4_error: latest msg from perror: Bad file descriptor
cat: No match.

>   stop error
===================================================================================================================




My input files are:

======== GaAs.in0 ===================================================

TOT   28    (5...CA-LDA, 13...PBE-GGA, 11...WC-GGA)
R2V      IFFT      (R2V)
  48  48  48    2.00    min IFFT-parameters, enhancement factor


======== GaAs.in0_grr ===============================================

TOT   50    (5...CA-LDA, 13...PBE-GGA, 11...WC-GGA)
R2V      IFFT      (R2V)
  48  48  48    2.00    min IFFT-parameters, enhancement factor


======== .machines ==================================================

#mpi-para for lapw0, kpoint-para for others.
lapw0:alpha1:4 alpha2:4
1:alpha1:1
1:alpha1:1
1:alpha1:1
1:alpha1:1
1:alpha2:1
1:alpha2:1
1:alpha2:1
1:alpha2:1



My complier, mathlib and make options are:

cc
ifort  (intel 11.0,include mkl)
mpif90 (mpich-1.2.7)
fftw2.1.5

current:FOPT:-FR -mp1 -w -prec_div -pc80 -pad -align -DINTEL_VML -traceback
current:FPOPT:$(FOPT)
current:LDFLAGS:$(FOPT)
-L/home/wxfeng/intel/Compiler/11.0/069/mkl/lib/em64t -pthread
-i-static
current:DPARALLEL:'-DParallel'
current:R_LIBS:-lmkl_intel_lp64 -lmkl_sequential -lmkl_core -lguide
current:RP_LIBS:-lmkl_scalapack_lp64 -lmkl_blacs_intelmpi_lp64
-L/home/wxfeng/apps/fftw2.1.5-mpich/lib -lfftw_mpi -lfftw $(R_LIBS)
current:MPIRUN:mpirun -np _NP_ -machinefile _HOSTS_ _EXEC_



What cause these problem, and how to handle? Thanks for your help!

Feng.


More information about the Wien mailing list