[Wien] forrtl: severe (41): insufficient virtual memory (file attached!!)

Laurence Marks L-marks at northwestern.edu
Sun Apr 15 03:15:14 CEST 2012


It is exactly what it says. You are trying to run more tasks on a single
cpu than you have memory for. The idea of mpi is to share cpu and memory.
If you have a cpu with 24 cores (unlikely) you might run (for instance) 3
tasks each using 8 cores, e.g. with three lines of node:8.

You probably only have 8 cores, so for a large job you might use node:8

Please do a little google searching on the principles of mpi, much better
than any email response.

---------------------------
Professor Laurence Marks
Department of Materials Science and Engineering
Northwestern University
www.numis.northwestern.edu 1-847-491-3996
"Research is to see what everybody else has seen, and to think what nobody
else has thought"
Albert Szent-Gyorgi
 On Apr 14, 2012 7:49 PM, "hyunjung kim" <angpangmokjang at hanmail.net> wrote:

> Dear all,
>
> (I'm sorry, I forgot to attach file which including error message and job
> script files)
>
> I constantly got following error messages when the parallel job was
> submitted.
>
> I attach it.
> Also the generated .machines file is attached, please check whether it is
> properly generated or not. I intended to do 24 k-point parallelized job.
>
> The compiler version is
> fortran : ifort, 12.0 (2011.3.174), mpif90 [ I got same error message
> within ifort 11.1 version, so I guess that fortran version is not the
> origin of this problem..]
> openmpi : 1.4.5
> FFTW2   : 2.1.5
> CC      : icc, 12.0 (2011.3.174)
> compiler option
>  O   Compiler options:        -FR -mp1 -w -prec_div -pc80 -pad -ip
> -DINTEL_VML -mcmodel=medium -i-dynamic -traceback -I$(MKLROOT)/include
>  L   Linker Flags:            $(FOPT) -L$(MKLROOT)/lib/$(MKL_TARGET_ARCH)
> -pthread
>  P   Preprocessor flags       '-DParallel'
>  R   R_LIB (LAPACK+BLAS):     -lmkl_lapack95_lp64 -lmkl_intel_lp64
> -lmkl_intel_thread -lmkl_core -openmp -lpthread
>
>  RP  RP_LIB(SCALAPACK+PBLAS): -lmkl_scalapack_lp64 -lmkl_solver_lp64
> -lmkl_blacs_lp64 -L$(FFTWPATH)/lib -lfftw_mpi -lfftw $(R_LIBS)
>  FP  FPOPT(par.comp.options): -FR -mp1 -w -prec_div -pc80 -pad -ip
> -DINTEL_VML -mcmodel=medium -i-dynamic -traceback -I$(MKLROOT)/include
>  MP  MPIRUN commando        : mpirun -mca btl self,openib -mca
> plm_rsh_num_concurrent 400 -mca oob_tcp_listen_mode listen_thread -mca
> plm_rsh_tree_spawn 1 -np _NP_ -machinefile _HOSTS_ _EXEC_
>
>
> The error messages is:
> ~~~~~~~~~~ abbreviation ~~~~~
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW0 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
>  LAPW1 END
> forrtl: severe (41): insufficient virtual memory
> Image              PC                Routine            Line        Source
> libintlc.so.5      00002B0540E88F7A  Unknown               Unknown  Unknown
> libintlc.so.5      00002B0540E87AF5  Unknown               Unknown  Unknown
> libifcoremt.so.5   00002B0540058CF2  Unknown               Unknown  Unknown
> libifcoremt.so.5   00002B053FFCAAAB  Unknown               Unknown  Unknown
> libifcoremt.so.5   00002B054001AFBA  Unknown               Unknown  Unknown
> libifcoremt.so.5   00002B054001AE11  Unknown               Unknown  Unknown
> lapwso             00000000004281C0  MAIN__                    131
>  lapwso.f
> lapwso             0000000000402A9C  Unknown               Unknown  Unknown
> libc.so.6          0000003CFA61D974  Unknown               Unknown  Unknown
> lapwso             00000000004029A9  Unknown               Unknown  Unknown
> forrtl: severe (41): insufficient virtual memory
> Image              PC                Routine            Line        Source
> libintlc.so.5      00002B5D32256F7A  Unknown               Unknown  Unknown
> libintlc.so.5      00002B5D32255AF5  Unknown               Unknown  Unknown
> libifcoremt.so.5   00002B5D31426CF2  Unknown               Unknown  Unknown
> libifcoremt.so.5   00002B5D31398AAB  Unknown               Unknown  Unknown
> libifcoremt.so.5   00002B5D313E8FBA  Unknown               Unknown  Unknown
> libifcoremt.so.5   00002B5D313E8E11  Unknown               Unknown  Unknown
> lapwso             0000000000409A6A  hmsout_mp_init_hm          78
>  modules.f
> lapwso             00000000004280E2  MAIN__                    130
>  lapwso.f
> lapwso             0000000000402A9C  Unknown               Unknown  Unknown
> libc.so.6          0000003CFA61D974  Unknown               Unknown  Unknown
> ~~~~~~ abbreviation ~~~~~~
>
> I note that the compilation was done without any error messages.
>
> Any advice will be greatly appreciated!
>
> ------------------------------------------------------------------------
> Hyun-Jung Kim (Ph.D student) | phone : ++82 10 7335 7889
> Department of Physics |
> Hanyang University | e-mail: angpangmokjang at h <hyunjung at fhi-berlin.mpg.de>
> anmail.net
> 17 Haengdang-Dong |
> 133-791 Seongdong-Ku,Seoul/Korea|
> ------------------------------------------------------------------------
>     www: http://physics.hanyang.ac.kr/~sst/
> ------------------------------------------------------------------------
>
>
>
>
>
>
>
>
>
>
>
> _______________________________________________
> Wien mailing list
> Wien at zeus.theochem.tuwien.ac.at
> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20120414/4f4e48c4/attachment.htm>


More information about the Wien mailing list