Well, bugging the poor sysadmin, we came up with a solution which, while not very advisable, will be a working patch for the moment: open the write permissions of my homedir on the afs system. At least now I can run my job on 45 processors :)<div>
<br></div><div>Cheers,</div><div><br></div><div>Marcos<br><br><div class="gmail_quote">2010/7/27 Marcos Veríssimo Alves <span dir="ltr"><<a href="mailto:marcos.verissimo.alves@gmail.com">marcos.verissimo.alves@gmail.com</a>></span><br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">Worse of all is that the disks are correctly mounted, and that from the command line I can do things like ls, even create and remove files. Only from within lapw1para it gives me an error. I am starting to insert lines with calls to unix utilities such as whoami in order to see what weird thing is going on there...<div>
<br></div><div>Thanks all for the suggestions. If I track this bug down I'll let you know.</div><div><br></div><div>Cheers,</div><div><br></div><div><font color="#888888">Marcos</font><div><div></div><div class="h5">
<br><br><div class="gmail_quote">On Tue, Jul 27, 2010 at 5:26 PM, Laurence Marks <span dir="ltr"><<a href="mailto:L-marks@northwestern.edu" target="_blank">L-marks@northwestern.edu</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">It is a system problem. Maybe the relevant disc is not mounted on the<br>
remote node or something? Try doing a simple ssh to the node and test<br>
things like ls, cd etc. Too many possibilities to list here. Good<br>
luck, just try computer experiments until you track it down.....<br>
<div><div></div><div><br>
2010/7/27 Marcos Veríssimo Alves <<a href="mailto:marcos.verissimo.alves@gmail.com" target="_blank">marcos.verissimo.alves@gmail.com</a>>:<br>
> Hi Laurence,<br>
> I am not running mpi, only using rsh/ssh for the plain k-point<br>
> parallelization. I couldn't really figure out how to make a .machines file<br>
> to run parallel over k-points on mpi, with one processor per machine.<br>
> However, I think Stefaan's tip has gone right to the point: in my job error<br>
> file I get the following errors:<br>
> LAPW0 END<br>
> .machinetmp222: No such file or directory<br>
> bash: line 0: cd: /afs/<a href="http://atc.unican.es/u/m/mverissi/WIEN2k/sro1sto6" target="_blank">atc.unican.es/u/m/mverissi/WIEN2k/sro1sto6</a>:<br>
> Permission denied<br>
> Cannot open error-file<br>
> ERRFLG - couldn't open errorflag-file.<br>
> The fact that from inside lapw1para the ssh command cannot cd to my home<br>
> directory puzzles me... it seems to be a system problem, then. However, if<br>
> you have any suggsestions, they will be more than welcome!<br>
> Thanks,<br>
> Marcos<br>
><br>
> On Tue, Jul 27, 2010 at 4:27 PM, Laurence Marks <<a href="mailto:L-marks@northwestern.edu" target="_blank">L-marks@northwestern.edu</a>><br>
> wrote:<br>
>><br>
>> I doubt (although I may be wrong) that this has anything to do with<br>
>> the OS. Do you have -traceback in your compile options? This will give<br>
>> information as to which program this is happening in. Also, are you<br>
>> running mpi or not?<br>
>><br>
>> 2010/7/27 Marcos Veríssimo Alves <<a href="mailto:marcos.verissimo.alves@gmail.com" target="_blank">marcos.verissimo.alves@gmail.com</a>>:<br>
>> > Hi Stefaan and Laurence,<br>
>> > @Stefaan: I will try it.<br>
>> > @Laurence: it's the latest version, which I have downloaded about two<br>
>> > weeks<br>
>> > ago. Hope this helps.<br>
>> > Thanks,<br>
>> > Marcos<br>
>> > On Tue, Jul 27, 2010 at 3:47 PM, Laurence Marks<br>
>> > <<a href="mailto:L-marks@northwestern.edu" target="_blank">L-marks@northwestern.edu</a>><br>
>> > wrote:<br>
>> >><br>
>> >> Is this the latest version, or an older one? Some changes were made in<br>
>> >> the error file access in the latest version for mpi reasons.<br>
>> >><br>
>> >> 2010/7/27 Marcos Veríssimo Alves <<a href="mailto:marcos.verissimo.alves@gmail.com" target="_blank">marcos.verissimo.alves@gmail.com</a>>:<br>
>> >> > Hi all,<br>
>> >> ><br>
>> >> > I am experiencing a problem in the execution in parallel over<br>
>> >> > k-points.<br>
>> >> ><br>
>> >> > I have compiled the code successfully in a cluster running Debian<br>
>> >> > Linux<br>
>> >> > and<br>
>> >> > with SGEEE as the queue system using ssh as the means to launch the<br>
>> >> > instances on the remote nodes, with /bin/bash as the shell. My script<br>
>> >> > successfully creates a .machines file and when I run runsp_lapw -p<br>
>> >> > -NI<br>
>> >> > -cc<br>
>> >> > 0.0001, the process dies. This is because, for some reason, lapw1para<br>
>> >> > is<br>
>> >> > not<br>
>> >> > being able to write to the up(dn)lapw1_*.error files:<br>
>> >> ><br>
>> >> > forrtl: severe (47): write to READONLY file, unit 99, file<br>
>> >> > /afs/<a href="http://atc.unican.es/u/m/mverissi/WIEN2k/sro1sto6/uplapw1_1.error" target="_blank">atc.unican.es/u/m/mverissi/WIEN2k/sro1sto6/uplapw1_1.error</a><br>
>> >> ><br>
>> >> > And the same happens to the dnlapw1_*.error files.<br>
>> >> ><br>
>> >> > lapw0, on the other hand, runs fine. I have set up parallel execution<br>
>> >> > successfully on my dual-core desktop using ssh, using pretty much the<br>
>> >> > same<br>
>> >> > stuff, and it runs perfectly well.<br>
>> >> ><br>
>> >> > Now, I have changed the write permissions of the directory (and all<br>
>> >> > the<br>
>> >> > files) with chmod -R ugo+rw /afs/<a href="http://atc.unican.es/u.." target="_blank">atc.unican.es/u..</a>., but to no avail.<br>
>> >> > Has<br>
>> >> > anyone experienced any problem like this before? Could there be any<br>
>> >> > known<br>
>> >> > (but obscure) reason why lapw1para would not be able to write to its<br>
>> >> > files,<br>
>> >> > but lapw0para would?<br>
>> >> ><br>
>> >> > Best regards,<br>
>> >> ><br>
>> >> > Marcos<br>
>> >> ><br>
>> >> > _______________________________________________<br>
>> >> > Wien mailing list<br>
>> >> > <a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a><br>
>> >> > <a href="http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien" target="_blank">http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien</a><br>
>> >> ><br>
>> >> ><br>
>> >><br>
>> >><br>
>> >><br>
>> >> --<br>
>> >> Laurence Marks<br>
>> >> Department of Materials Science and Engineering<br>
>> >> MSE Rm 2036 Cook Hall<br>
>> >> 2220 N Campus Drive<br>
>> >> Northwestern University<br>
>> >> Evanston, IL 60208, USA<br>
>> >> Tel: (847) 491-3996 Fax: (847) 491-7820<br>
>> >> email: L-marks at northwestern dot edu<br>
>> >> Web: <a href="http://www.numis.northwestern.edu" target="_blank">www.numis.northwestern.edu</a><br>
>> >> Chair, Commission on Electron Crystallography of IUCR<br>
>> >> <a href="http://www.numis.northwestern.edu/" target="_blank">www.numis.northwestern.edu/</a><br>
>> >> Electron crystallography is the branch of science that uses electron<br>
>> >> scattering and imaging to study the structure of matter.<br>
>> >> _______________________________________________<br>
>> >> Wien mailing list<br>
>> >> <a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a><br>
>> >> <a href="http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien" target="_blank">http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien</a><br>
>> ><br>
>> ><br>
>> > _______________________________________________<br>
>> > Wien mailing list<br>
>> > <a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a><br>
>> > <a href="http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien" target="_blank">http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien</a><br>
>> ><br>
>> ><br>
>><br>
>><br>
>><br>
>> --<br>
>> Laurence Marks<br>
>> Department of Materials Science and Engineering<br>
>> MSE Rm 2036 Cook Hall<br>
>> 2220 N Campus Drive<br>
>> Northwestern University<br>
>> Evanston, IL 60208, USA<br>
>> Tel: (847) 491-3996 Fax: (847) 491-7820<br>
>> email: L-marks at northwestern dot edu<br>
>> Web: <a href="http://www.numis.northwestern.edu" target="_blank">www.numis.northwestern.edu</a><br>
>> Chair, Commission on Electron Crystallography of IUCR<br>
>> <a href="http://www.numis.northwestern.edu/" target="_blank">www.numis.northwestern.edu/</a><br>
>> Electron crystallography is the branch of science that uses electron<br>
>> scattering and imaging to study the structure of matter.<br>
>> _______________________________________________<br>
>> Wien mailing list<br>
>> <a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a><br>
>> <a href="http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien" target="_blank">http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien</a><br>
><br>
><br>
> _______________________________________________<br>
> Wien mailing list<br>
> <a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a><br>
> <a href="http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien" target="_blank">http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien</a><br>
><br>
><br>
<br>
<br>
<br>
--<br>
Laurence Marks<br>
Department of Materials Science and Engineering<br>
MSE Rm 2036 Cook Hall<br>
2220 N Campus Drive<br>
Northwestern University<br>
Evanston, IL 60208, USA<br>
Tel: (847) 491-3996 Fax: (847) 491-7820<br>
email: L-marks at northwestern dot edu<br>
Web: <a href="http://www.numis.northwestern.edu" target="_blank">www.numis.northwestern.edu</a><br>
Chair, Commission on Electron Crystallography of IUCR<br>
<a href="http://www.numis.northwestern.edu/" target="_blank">www.numis.northwestern.edu/</a><br>
Electron crystallography is the branch of science that uses electron<br>
scattering and imaging to study the structure of matter.<br>
_______________________________________________<br>
Wien mailing list<br>
<a href="mailto:Wien@zeus.theochem.tuwien.ac.at" target="_blank">Wien@zeus.theochem.tuwien.ac.at</a><br>
<a href="http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien" target="_blank">http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien</a><br>
</div></div></blockquote></div><br></div></div></div>
</blockquote></div><br></div>