<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<p>The "ssh cn308 ldd $WIENROOT/lapw0_mpi" is finding files for your
ifort installation like
/THFS/opt/intel/composer_xe_2013_sp1.3.174/mkl/lib/intel64/libmkl_scalapack_lp64.so
just fine. So your environmental variables seem to be setup and
working fine on both nodes. It looks like the
/opt/intel/impi/5.0.2.044/intel64/lib/libmpifort.so.12 exists on
the renwei node but not on the cn308 node. It looks to me that
Intel MPI (impi) is not installed on the cn308 node.</p>
<p>Perhaps the cn308 node is using a different partition or
different shared drive. I have read that there are different
possible solutions for the slurm cluster problem you seem to have
which depend on how it is configured [
<a class="moz-txt-link-freetext" href="https://lists.schedmd.com/pipermail/slurm-users/2017-December/000272.html">https://lists.schedmd.com/pipermail/slurm-users/2017-December/000272.html</a>
]. You might be able to check which partition the renwei node and
cn308 node are using with sinfo [
<a class="moz-txt-link-freetext" href="https://slurm.schedmd.com/sinfo.html">https://slurm.schedmd.com/sinfo.html</a> ].<br>
</p>
<p>Maybe you just have to have your cluster manager (administrator,
help desk, ...) install impi like what you did for ifort. To
remove the "manpath: command not found", the cluster manager
probably just has to install the man or man-db package on the
cn308 node (they should be able to check the documentation or
forums for the OS that their cluster is using on how to install
manpath, typically for example: yum install man or apt-get install
man-db). I have never performed administration functions of a
slurm cluster, so for additional help with your problem you may
have to ask a slurm expert (e.g., your cluster manager or the
slurm mailing list [ <a class="moz-txt-link-freetext" href="https://slurm.schedmd.com/mail.html">https://slurm.schedmd.com/mail.html</a> ]).<br>
</p>
<div class="moz-cite-prefix">On 6/16/2018 4:28 AM, venkatesh
chandragiri wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CACS+pXMiyHc0-cEtuPOrDPA-rd9gMqJG7q4mLnTQb7GejKn54Q@mail.gmail.com">
<div dir="ltr">
<div><br>
</div>
<div>Dear Prof. Marks,</div>
<div><br>
</div>
<div>I did "ssh othernode ldd $WIENROOT/lapw0_mpi".</div>
<div><br>
</div>
<div>=========</div>
<div><br>
</div>
<div>[renwei@ln3 ~]$ ssh cn308 ldd $WIENROOT/lapw0_mpi<br>
/THFS/opt/intel/composer_xe_2013_sp1.3.174/mkl/bin/mklvars.sh:
line 118: manpath: command not found<br>
linux-vdso.so.1 => (0x00007fffd8fff000)<br>
libfftw3_mpi.so.3 =>
/THFS/home/renwei/venky/soft/fftw/lib/libfftw3_mpi.so.3
(0x00007fd41621d000)<br>
libmkl_scalapack_lp64.so =>
/THFS/opt/intel/composer_xe_2013_sp1.3.174/mkl/lib/intel64/libmkl_scalapack_lp64.so
(0x00007fd415947000)<br>
libmkl_blacs_intelmpi_lp64.so =>
/THFS/opt/intel/composer_xe_2013_sp1.3.174/mkl/lib/intel64/libmkl_blacs_intelmpi_lp64.so
(0x00007fd41570a000)<br>
libfftw3.so.3 =>
/THFS/home/renwei/venky/soft/fftw/lib/libfftw3.so.3
(0x00007fd4153fe000)<br>
libmkl_intel_lp64.so =>
/THFS/opt/intel/composer_xe_2013_sp1.3.174/mkl/lib/intel64/libmkl_intel_lp64.so
(0x00007fd414cb0000)<br>
libmkl_intel_thread.so =>
/THFS/opt/intel/composer_xe_2013_sp1.3.174/mkl/lib/intel64/libmkl_intel_thread.so
(0x00007fd413c90000)<br>
libmkl_core.so =>
/THFS/opt/intel/composer_xe_2013_sp1.3.174/mkl/lib/intel64/libmkl_core.so
(0x00007fd41259c000)<br>
libpthread.so.0 => /lib64/libpthread.so.0
(0x00007fd412380000)<br>
<strong><u> libmpifort.so.12 => not found<br>
libmpi.so.12 => not found</u></strong><br>
libdl.so.2 => /lib64/libdl.so.2
(0x00007fd412172000)<br>
librt.so.1 => /lib64/librt.so.1
(0x00007fd411f69000)<br>
libm.so.6 => /lib64/libm.so.6 (0x00007fd411ce5000)<br>
libiomp5.so =>
/opt/intel/composer_xe_2013_sp1.3.174/compiler/lib/intel64/libiomp5.so
(0x00007fd4119ca000)<br>
libc.so.6 => /lib64/libc.so.6 (0x00007fd411628000)<br>
libgcc_s.so.1 =>
/THFS/home/sh-hzw2/software/Matlab2014a//sys/os/glnxa64/libgcc_s.so.1
(0x00007fd411413000)<br>
libimf.so =>
/opt/intel/composer_xe_2013_sp1.3.174/compiler/lib/intel64/libimf.so
(0x00007fd410f50000)<br>
libsvml.so =>
/opt/intel/composer_xe_2013_sp1.3.174/compiler/lib/intel64/libsvml.so
(0x00007fd410354000)<br>
libirng.so =>
/opt/intel/composer_xe_2013_sp1.3.174/compiler/lib/intel64/libirng.so
(0x00007fd41014d000)<br>
libintlc.so.5 =>
/opt/intel/composer_xe_2013_sp1.3.174/compiler/lib/intel64/libintlc.so.5
(0x00007fd40fef7000)<br>
/lib64/ld-linux-x86-64.so.2 (0x00007fd416436000)<br>
<br>
</div>
<div>=========</div>
<div><br>
</div>
<div>As it is shown here <strong><u> libmpifort.so.12
=> not found, libmpi.so.12 => not found when
I run in cn308 node</u></strong></div>
<div><br>
</div>
<div>But these have well defined paths when run ldd at "renwei"</div>
<div><br>
</div>
<div> libmpifort.so.12 => /opt/intel/impi/<a
href="http://5.0.2.044/intel64/lib/libmpifort.so.12"
moz-do-not-send="true">5.0.2.044/intel64/lib/libmpifort.so.12</a>
(0x00002b3a37c98000)<br>
libmpi.so.12 => /opt/intel/impi/<a
href="http://5.0.2.044/intel64/lib/libmpi.so.12"
moz-do-not-send="true">5.0.2.044/intel64/lib/libmpi.so.12</a>
(0x00002b3a37f21000)<br>
<br>
</div>
<div>===================<br>
</div>
<div><br>
</div>
<div>[renwei@ln3 ~]$ ssh cn308 $WIENROOT/lapw0_mpi<br>
/THFS/opt/intel/composer_xe_2013_sp1.3.174/mkl/bin/mklvars.sh:
line 118: manpath: command not found<br>
/THFS/home/renwei/venky/soft/wien2k/lapw0_mpi: error while
loading shared libraries: libmpifort.so.12: cannot open shared
object file: No such file or directory<br>
[renwei@ln3 ~]$</div>
<div><br>
</div>
<div><br>
</div>
<div>===============</div>
<div><br>
</div>
<div>[renwei@ln3 ~]$ ssh cn308<br>
Last login: Sat Jun 16 17:59:04 2018 from ln3-gn0<br>
-bash: manpath: command not found<br>
[renwei@cn308 ~]$ $WIENROOT/lapw0_mpi<br>
/THFS/home/renwei/venky/soft/wien2k/lapw0_mpi: error while
loading shared libraries: libmpifort.so.12: cannot open shared
object file: No such file or directory<br>
</div>
<p><strong><br>
</strong></p>
<div><strong>========================</strong></div>
<div><br>
</div>
<div>You also mentioned to use " use static compilation". I
don't understand this. do you meant to be static compilation
of wien2k..? how I can do it (I am sorry to ask this, as I
belongs to experimental background I don't come across these
kind of issues). </div>
<div><br>
</div>
<div><br>
</div>
<div>thank you.</div>
<div><br>
</div>
<div>venkatesh<br>
</div>
</div>
</blockquote>
</body>
</html>