[Wien] Hardware Benchmarks for lapw1c

Florent Boucher Florent.Boucher at cnrs-imn.fr
Fri Jan 26 12:00:15 CET 2007


Normally, even if the MPI version is compiled you can run a parallel 
lapw1 calculation without any recompilation. You just have to be careful 
in the way you prepare the machine file (one line per process with 
1:localhost), the option you use in $WIENROOT/parallel_options, the 
definition of usermote in the lapw1para script.


            if($number_per_job[$p] == 1 ) then
              if ($useremote == 1) then
                    ($remote  $machine[$p] $t $exe ${def}_$loop.def;rm 
-f .lock_$lockfile[$p]) >>.time1_$loop &
              else
                  (cd $PWD;$t $exe ${def}_$loop.def;rm -f 
.lock_$lockfile[$p]) >>.time1_$loop &
              endif
            else
              set ttt=(`echo $mpirun | sed -e 
"s^_NP_^$number_per_job[$p]^" -e "s^_EXEC_^$WIENROOT/${exe}_mpi ${def
}_$loop.def^" -e "s^_HOSTS_^.machine[$p]^"`)
              (cd $PWD;$t $ttt;rm -f .lock_$lockfile[$p]) >>.time1_$loop &

If you have 8 k points, I am expecting something like 130-140% total CPU 
time on your 8 cores by running 8 para process with OMP_BUM_THRERADS=1 
that is at least much better than using OMP_NUM_THREADS=8
Regards
Florent

 

-- 
 -------------------------------------------------------------------------
| Florent BOUCHER                    |                                    |
| Institut des Matériaux Jean Rouxel | Mailto:Florent.Boucher at cnrs-imn.fr |
| 2, rue de la Houssinière           | Phone: (33) 2 40 37 39 24          |
| BP 32229                           | Fax:   (33) 2 40 37 39 95          |
| 44322 NANTES CEDEX 3 (FRANCE)      | http://www.cnrs-imn.fr             |
 -------------------------------------------------------------------------



More information about the Wien mailing list