[Wien] k-parallel execution II
Torsten Andersen
thor at physik.uni-kl.de
Wed Nov 26 08:46:34 CET 2003
Bertoni Giovanni wrote:
> First: Sorry gelfand, I have done an auto-reply and I sent you directly
> the mail istead to the wien-list!
>
> second: Dear users,
>
> putting in the .machine:
>
> 1:localhost
> 1:localhost
>
> for a 2 processors machine, I have the error
>
> STDOUT
> ----------------------------------
> LAPW0 END
> localhost: Connection refused
> localhost: Connection refused
> -----------------------------------
Do you have the loopback device activated? In /etc/hosts should be a
line with 127.0.0.1 localhost.
Does your host accept rsh or ssh connections?
Have you set your machine up as a shared-memory machine in siteconfig?
Then it should be started in the background, not via rsh, I think...
>
> is it a probem of configuration?
> please tell me.
> thank you
> giovanni
>
> gelfand a écrit :
>
>>This is very easy to do.
>>In the .machines file, just put
>>1:localhost
>>1:localhost
>>1:localhost
>><<as many lines as you have processors>>
>>This only makes sense if you are using single-threaded BLAS/LAPACK
>>libraries. For dual-CPU machines it is definitely the way to go.
>>For a 16-way machine, I'm not sure, it depends on how much RAM
>>is available per CPU (you certainly don't want to cause the machine
>>to swap to disk).
>>
>>
>>>===== Original Message From Bertoni Giovanni <giovanni at cemes.fr> =====
>>>Dear users,
>>>
>>>I have a question about k-parallelization:
>>>
>>>is it possible to do a k-parallelization on a single machine with
>>>multiple processors?
>>>for exemple, I try on a machine with >= 8 processors.
>>>I have tried with the TiC example (47 k-points) in interactive mode (no
>>>scripts or batch files...)
>>>
>>>this is the .machine file
>>>-----------------------------------------------
>>># .machines is the control file for parallel execution. Add lines like
>>>#
>>># speed:machine_name
>>>#
>>># for each machine specifying there relative speed. For mpi
>>>parallelization use
>>>#
>>># speed:machine_name:1 machine_name:1
>>># lapw0:machine_name:1 machine_name:1
>>>#
>>># further options are:
>>>#
>>># granularity:number (for loadbalancing on irregularly used machines)
>>># residue:machine_name (on shared memory machines)
>>># extrafine (to distribute the remaining k-points one after
>>>the other)
>>>#
>>># granularity sets the number of files that will be approximately
>>># be generated by each processor; this is used for load-balancing.
>>># On very homogeneous systems set number to 1
>>># if after distributing the k-points to the various machines residual
>>># k-points are left, they will be distributed to the
>>>residual-machine_name.
>>>#
>>>1:localhost:8
>>>lapw0:localhost:8
>>>--------------------------------------------------
>>>
>>>..very simple. and this is the TiC.dayfile
>>>
>>>--------------------------------------------------
>>>...
>>> start (Tue Nov 25 18:04:11 MET 2003) with lapw0 (20/20 to go)
>>>
>>>> lapw0 -p (18:04:11) starting parallel lapw0 at Tue Nov 25 18:04:12 MET
>>
>>2003
>>
>>>-------- .machine1 : 8 processors
>>>localhost:8
>>>--------
>>>** lapw0 crashed!
>>>0.0u 0.3s 0:01 24% 0+0k 10+3io 0pf+0w
>>>
>>>
>>>> stop error
>>>
>>>---------------------------------------------------
>>>
>>>
>>>if i don't use parallel lapw0, I have the crash in lapw1. the test was
>>>done on a machine that has the mpirun.
>>>please tell me.
>>>if is not possible to do a k-parallelization on a single machine, I will
>>>use a fine grained parallelization, and I will have another question for
>>>you... (next time)
>>>Thank you.
>>>giovanni.
>>>_______________________________________________
>>>Wien mailing list
>>>Wien at zeus.theochem.tuwien.ac.at
>>>http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
>>
>>Martin Gelfand
>>
>>Associate Professor.............Phone: 970 491 5263
>>Department of Physics.............Fax: 970 491 7947
>>Colorado State University.......Email: gelfand at lamar.colostate.edu
>>Fort Collins CO 80523-1875
>
> _______________________________________________
> Wien mailing list
> Wien at zeus.theochem.tuwien.ac.at
> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
>
--
Dr. Torsten Andersen TA-web: http://deep.at/myspace/
AG Hübner, Department of Physics, Kaiserslautern University, and
Condensed Matter Theory Group, Department of Physics, Uppsala University
Web: http://www.fysik4.fysik.uu.se/ http://www.physik.uni-kl.de/
More information about the Wien
mailing list