<div dir="ltr"><div><div><div><div><div>Dear Professor Blaha,<br><br></div> Thank you !<br></div> My .machines file is 0K.<br></div> I will ask the administrator to follow your other suggestions (users do not have privileges).<br></div> All the best,<br></div> Luis<br><br></div><div class="gmail_extra"><br><div class="gmail_quote">2015-09-21 10:22 GMT-03:00 Peter Blaha <span dir="ltr"><<a href="mailto:pblaha@theochem.tuwien.ac.at" target="_blank">pblaha@theochem.tuwien.ac.at</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">a) Check your .machines file. DFoes it meet your expectations, or has this node too large load.<br>
<br>
b) Can you interactively login into these nodes while your job is running ?<br>
If yes, login on 2 nodes (in two windows) and run top<br>
<br>
c) If nothing obvious is wrong so far, test the network by doing some bigger copying from/to these nodes from your $home (or $scratch) to see if file-io is killing you.<span class=""><br>
<br>
<br>
On 09/21/2015 02:51 PM, Luis Ogando wrote:<br>
</span><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">
Dear Prof. Marks,<br>
<br>
Many thanks for your help.<br>
The administrators said that everything is 0K, the software is the<br>
problem (the easy answer) : no zombies, no other jobs in the node, ... !!<br>
Let me give you more information to see if you can imagine other<br>
possibilities:<br>
<br>
1) Intel Xeon Six Core 5680, 3.33GHz<br>
<br>
2) Intel(R) Fortran/CC/OpenMPI Intel(R) 64 Compiler XE for applications<br>
running on Intel(R) 64, Version 12.1.1.256 Build 20111011<br>
<br>
3) OpenMPI 1.6.5<br>
<br>
4) PBS Pro 11.0.2<br>
<br>
5) OpenMPI built using --with-tm due to prohibited ssh among nodes (<br>
<a href="http://www.open-mpi.org/faq/?category=building#build-rte-tm" rel="noreferrer" target="_blank">http://www.open-mpi.org/faq/?category=building#build-rte-tm</a> )<br>
<br>
6) Wien2k 14.2<br>
<br>
7) The mystery : two weeks ago, everything was working properly !!<br>
<br>
Many thanks again !<br>
All the best,<br>
Luis<br>
<br>
2015-09-18 23:24 GMT-03:00 Laurence Marks <<a href="mailto:laurence.marks@gmail.com" target="_blank">laurence.marks@gmail.com</a><br></span>
<mailto:<a href="mailto:laurence.marks@gmail.com" target="_blank">laurence.marks@gmail.com</a>>>:<span class=""><br>
<br>
Almost certainly one or more of:<br>
* Other jobs on the node<br>
* Zombie process(es)<br>
* Too many mpi<br>
* Bad memory<br>
* Full disc<br>
* Too hot<br>
<br>
If you have it use ganglia, if not ssh in and use top/ps or whatever<br>
SGI has. If you cannot sudo get help from someone who can.<br>
<br>
On Sep 18, 2015 8:58 PM, "Luis Ogando" <<a href="mailto:lcodacal@gmail.com" target="_blank">lcodacal@gmail.com</a><br></span><span class="">
<mailto:<a href="mailto:lcodacal@gmail.com" target="_blank">lcodacal@gmail.com</a>>> wrote:<br>
<br>
Dear Wien2k community,<br>
<br>
I am using Wien2k in a SGI cluster with 32 nodes. My<br>
calculation is running in 4 nodes that have the same<br>
characteristics and only my job is running in these 4 nodes.<br>
I noticed that one of these 4 nodes is spending more than 20<br>
times the time spent by the other 3 nodes in the run_lapw execution.<br>
Could someone imagine a reason for this ? Any advice ?<br>
All the best,<br>
Luis<br>
<br>
<br>
_______________________________________________<br>
Wien mailing list<br></span>
<a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a> <mailto:<a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a>><span class=""><br>
<a href="http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien" rel="noreferrer" target="_blank">http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien</a><br>
SEARCH the MAILING-LIST at:<br>
<a href="http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html" rel="noreferrer" target="_blank">http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html</a><br>
<br>
<br>
<br>
<br>
_______________________________________________<br>
Wien mailing list<br>
<a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a><br>
<a href="http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien" rel="noreferrer" target="_blank">http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien</a><br>
SEARCH the MAILING-LIST at: <a href="http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html" rel="noreferrer" target="_blank">http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html</a><br>
<br>
</span></blockquote><span class="HOEnZb"><font color="#888888">
<br>
-- <br>
<br>
P.Blaha<br>
--------------------------------------------------------------------------<br>
Peter BLAHA, Inst.f. Materials Chemistry, TU Vienna, A-1060 Vienna<br>
Phone: <a href="tel:%2B43-1-58801-165300" value="+43158801165300" target="_blank">+43-1-58801-165300</a> FAX: <a href="tel:%2B43-1-58801-165982" value="+43158801165982" target="_blank">+43-1-58801-165982</a><br>
Email: <a href="mailto:blaha@theochem.tuwien.ac.at" target="_blank">blaha@theochem.tuwien.ac.at</a> WIEN2k: <a href="http://www.wien2k.at" rel="noreferrer" target="_blank">http://www.wien2k.at</a><br>
WWW: <a href="http://www.imc.tuwien.ac.at/staff/tc_group_e.php" rel="noreferrer" target="_blank">http://www.imc.tuwien.ac.at/staff/tc_group_e.php</a><br>
--------------------------------------------------------------------------</font></span><div class="HOEnZb"><div class="h5"><br>
_______________________________________________<br>
Wien mailing list<br>
<a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a><br>
<a href="http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien" rel="noreferrer" target="_blank">http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien</a><br>
SEARCH the MAILING-LIST at: <a href="http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html" rel="noreferrer" target="_blank">http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html</a><br>
</div></div></blockquote></div><br></div>