[Wien] Why is "sleep $delay" commented out in lapw1para_lapw?
David Olmsted
olmsted at berkeley.edu
Mon Apr 6 21:37:09 CEST 2015
Laurence,
Thank you for the response. As I mentioned in my first try at this issue last week, I have put the "sleep $delay" back in, and it does seem to have helped. I think I have less failures when the lapw1 processes are being started. But I still have some, so I am not certain. I am still working with the cluster's consultants on this.
Nonetheless, the other scripts do have "sleep $delay", and at the top of lapw1para_lapw it does say
#In this section use 0 to turn of an option, 1 to turn it on,
#respectively choose a value
set useremote = 1 # using remote shell to launch processes
set mpiremote = 1 # using remote shell to launch mpi
set delay = 0.1 # delay launching of processes by n seconds
set sleepy = 1.0 # additional sleep before checking
set debug = 0 # verbosity of debugging output
set taskset0
set taskset=no
Given those two things, it seems to me that it would be more appropriate for the delay to actually exist in lapw1para_lapw. But not my call.
Thank you for your help.
Cheers,
David
-----Original Message-----
From: wien-bounces at zeus.theochem.tuwien.ac.at [mailto:wien-bounces at zeus.theochem.tuwien.ac.at] On Behalf Of Laurence Marks
Sent: Monday, April 06, 2015 12:14 PM
To: A Mailing list for WIEN2k users
Subject: Re: [Wien] Why is "sleep $delay" commented out in lapw1para_lapw?
Dear David,
I think the answer to your question "why" is "because".
Often for things like this it is some combination of "seat of the pants" gut instinct and KISS. I am not certain why I used 0.25 in my version, and I think I have recently reduced it to 0.1. I will admit that I never tested in great detail whether 0.25 was better or worse, it really will depend heavily upon the cluster.
Similarly I suspect the delay between launching ssh was probably removed as it did not seem to matter. My suggestion would be to put it back and see if it helps.
I agree that it would be better to have this (and various other
things) set in parallel_options.
Not the most clear answer, sorry.
On Mon, Apr 6, 2015 at 11:32 AM, David Olmsted <olmsted at berkeley.edu> wrote:
> Hi,
>
> There has been no response to my suggestion that in lapw1para_lapw, the
> line “# sleep $delay” be changed to “sleep $delay”. Perhaps I have not
> given enough information. In the userguide there is no mention of “delay”.
> In the archive I find nothing explaining why the line is commented
> out. (Or even explaining that it is commented out.) In
> lapw2para_lapw, for example, the “sleep $delay” line is actually in
> use, rather than commented out. The same is true in some of the other
> scripts. Why the difference in lapw1para_lapw?
>
>
>
> I am using version 14.2 on a large linux cluster with TORQUE. I was using
> a revised version of a parallel_options file from a post by Lawrence Marks
> which included “set delay = 0.25”, and was surprised to discover this did
> not actually take effect in lapw1para_lapw.
>
>
>
> Thanks,
>
> David
>
>
>
> David Olmsted
>
> Assistant Research Engineer
>
> Materials Science and Engineering
>
> 210 Hearst Memorial Mining Building
>
> University of California
>
> Berkeley, CA 94720-1760
>
>
--
Professor Laurence Marks
Department of Materials Science and Engineering Northwestern University www.numis.northwestern.edu Corrosion in 4D: MURI4D.numis.northwestern.edu Co-Editor, Acta Cryst A "Research is to see what everybody else has seen, and to think what nobody else has thought"
Albert Szent-Gyorgi
_______________________________________________
Wien mailing list
Wien at zeus.theochem.tuwien.ac.at
http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
SEARCH the MAILING-LIST at: http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html
More information about the Wien
mailing list