[Wien] k-parallel execution II

Bertoni Giovanni giovanni at cemes.fr
Tue Nov 25 19:54:33 CET 2003


First: Sorry gelfand, I have done an auto-reply and I sent you directly
the mail istead to the wien-list!

second: Dear users, 

putting in the .machine:

1:localhost
1:localhost

for a 2 processors machine, I have the error

STDOUT 
----------------------------------
LAPW0 END
localhost: Connection refused
localhost: Connection refused
-----------------------------------

is it a probem of configuration?
please tell me.
thank you
giovanni

gelfand a écrit :
> 
> This is very easy to do.
> In the .machines file, just put
> 1:localhost
> 1:localhost
> 1:localhost
> <<as many lines as you have processors>>
> This only makes sense if you are using single-threaded BLAS/LAPACK
> libraries.  For dual-CPU machines it is definitely the way to go.
> For a 16-way machine, I'm not sure, it depends on how much RAM
> is available per CPU (you certainly don't want to cause the machine
> to swap to disk).
> 
> >===== Original Message From Bertoni Giovanni <giovanni at cemes.fr> =====
> >Dear users,
> >
> >I have a question about k-parallelization:
> >
> >is it possible to do a k-parallelization on a single machine with
> >multiple processors?
> >for exemple, I try on a machine with >= 8 processors.
> >I have tried with the TiC example (47 k-points) in interactive mode (no
> >scripts or batch files...)
> >
> >this is the .machine file
> >-----------------------------------------------
> ># .machines is the control file for parallel execution. Add lines like
> >#
> >#   speed:machine_name
> >#
> ># for each machine specifying there relative speed. For mpi
> >parallelization use
> >#
> >#   speed:machine_name:1 machine_name:1
> >#   lapw0:machine_name:1 machine_name:1
> >#
> ># further options are:
> >#
> >#   granularity:number (for loadbalancing on irregularly used machines)
> >#   residue:machine_name  (on shared memory machines)
> >#   extrafine         (to distribute the remaining k-points one after
> >the other)
> >#
> ># granularity sets the number of files that will be approximately
> ># be generated by each processor; this is used for load-balancing.
> ># On very homogeneous systems set number to 1
> ># if after distributing the k-points to the various machines residual
> ># k-points are left, they will be distributed to the
> >residual-machine_name.
> >#
> >1:localhost:8
> >lapw0:localhost:8
> >--------------------------------------------------
> >
> >..very simple. and this is the TiC.dayfile
> >
> >--------------------------------------------------
> >...
> >    start      (Tue Nov 25 18:04:11 MET 2003) with lapw0 (20/20 to go)
> >>   lapw0 -p   (18:04:11) starting parallel lapw0 at Tue Nov 25 18:04:12 MET
> 2003
> >-------- .machine1 : 8 processors
> >localhost:8
> >--------
> >**  lapw0 crashed!
> >0.0u 0.3s 0:01 24% 0+0k 10+3io 0pf+0w
> >
> >>   stop error
> >---------------------------------------------------
> >
> >
> >if i don't use parallel lapw0, I have the crash in lapw1. the test was
> >done on a machine that has the mpirun.
> >please tell me.
> >if is not possible to do a k-parallelization on a single machine, I will
> >use a fine grained parallelization, and I will have another question for
> >you... (next time)
> >Thank you.
> >giovanni.
> >_______________________________________________
> >Wien mailing list
> >Wien at zeus.theochem.tuwien.ac.at
> >http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
> 
> Martin Gelfand
> 
> Associate Professor.............Phone: 970 491 5263
> Department of Physics.............Fax: 970 491 7947
> Colorado State University.......Email: gelfand at lamar.colostate.edu
> Fort Collins CO 80523-1875



More information about the Wien mailing list