[Wien] parallel wien2k
Zhiyong Zhang
zyzhang at stanford.edu
Tue Feb 23 06:15:44 CET 2010
Hello Ricardo and All,
Thank you for the information. I think you are right that part of the problem is because no forces printed. The example I am using is the TiC in the user guide. when I used "run_lapw -i 40 0.001 -I" in serial mode it worked fine.
The problem "/home/zzhang/wien2k/lapw1para lapw1.def" seems to be due to the .machines file definition. If I remove the "lapw1:nx1 nx1 nx62 nx62" from the .machines file ans use the following .machines file,
lapw0:nx1 nx1 nx62 nx62
1:nx1
1:nx1
1:nx62
1:nx62
granularity:1
extrafine:1
Then the LAPW1 can run in parallel.
Does this mean that lapw1/2 can only be run in k-point parallel mode, not fine grain MPI mode?
How ever, I still got the following error in TiC.dayfile:
4 number_of_parallel_jobs
nx1(11) 0.226u 0.017s 0.31 76.18% 0+0k 0+0io 0pf+0w
nx1(11) 0.224u 0.009s 0.31 73.04% 0+0k 0+0io 0pf+0w
nx62(11) 0.222u 0.008s 0.32 71.21% 0+0k 0+0io 0pf+0w
nx62(11) 0.222u 0.010s 0.26 88.21% 0+0k 0+0io 0pf+0w
nx1(1) 0.224u 0.008s 0.26 88.89% 0+0k 0+0io 0pf+0w
nx1(1) 0.223u 0.008s 0.26 88.17% 0+0k 0+0io 0pf+0w
nx62(1) 0.222u 0.009s 0.26 86.19% 0+0k 0+0io 0pf+0w
** LAPW1 crashed!
0.062u 0.436s 0:11.45 4.2% 0+0k 0+0io 0pf+0w
error: command /home/zzhang/wien2k/lapw1para lapw1.def failed
Which files should I read to find possible causes of the crash? I looked the *.error files but can't seem to find anything useful.
Best,
Zhiyong
----- Original Message -----
From: "Ricardo Faccio" <rfaccio at fq.edu.uy>
To: "A Mailing list for WIEN2k users" <wien at zeus.theochem.tuwien.ac.at>
Sent: Monday, February 22, 2010 8:28:35 PM GMT -08:00 US/Canada Pacific
Subject: Re: [Wien] parallel wien2k
Hi Zhiyong
What is your test case? remember that forces are printed if you have atoms
located in general positions. For example, Fe in the bcc space group, will
not print forces, since all atoms have the same symmetric environment.
Regards
Ricardo
--
-------------------------------------------------------------------------
----- Dr. Ricardo Faccio
Mail: Cryssmat-Lab., Cátedra de Física, DETEMA
Facultad de Química, Universidad de la República
Av. Gral. Flores 2124, C.C. 1157
C.P. 11800, Montevideo, Uruguay.
E-mail: rfaccio at fq.edu.uy
Phone: 598 2 9241860 Int. 109
598 2 9290705
Fax: 598 2 9241906
Web: http://cryssmat.fq.edu.uy/ricardo/ricardo.htm
> Dear All,
>
>
>
> I am trying to test wien2k in parallel mode and I got into some problem. I
> am using
>
>
>
> run_lapw -p -i 40 -fc 0.001 -I
>
>
>
> If I use a number of 0.001 for the option fc above, I got the following
> error:
>
>
>
> Force-convergence not possible. Forces not present.
>
>
>
> If I do not use a number for the -fc option, but use "run_lapw -p -i 40
> -fc
> -I" instead
>
>
>
> Then lapw0 finishes without a problem but the program doesn't branch to
> lapw1. An error message is generated when doing the test
>
>
>
> "if ($fcut == "0") goto lapw1
>
>
>
> I was able to do "run_lapw -p -i 40 -I", without the "-fc" option at all
> and
> was able to finish "lapw0 -p" and then start "lapw1 -p" but got into the
> following error:
>
>
>
> error: command /home/zzhang/wien2k/lapw1para lapw1.def failed
>
>
>
> Does anybody have similar problems and know how to fix this?
>
>
>
> It does the following:
>
>
>
> running LAPW1 in parallel mode (using .machines)
>
>
>
> and the .machines file is as follows:
>
>
>
> #
>
> lapw0:nx1 nx1 nx62 nx62
>
> lapw1:nx1 nx1 nx62 nx62
>
> lapw2:nx1 nx1 nx62 nx62
>
> 1:nx1
>
> 1:nx1
>
> 1:nx62
>
> 1:nx62
>
> granularity:1
>
> extrafine:1
>
>
>
> Thanks,
>
> Zhiyong
>
>
>
> _______________________________________________
> Wien mailing list
> Wien at zeus.theochem.tuwien.ac.at
> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
>
_______________________________________________
Wien mailing list
Wien at zeus.theochem.tuwien.ac.at
http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
More information about the Wien
mailing list