[Wien] error in LAPW1 (parallel calculation)

Khuong P. Ong ongpk at ihpc.a-star.edu.sg
Mon Jun 6 12:56:36 CEST 2005


Dear Wien users,

  Could someone help me to overcome the following error:

     cycle 1     (Mon Jun  6 13:13:48 SGT 2005)  (40/40 to go)

 >   lapw0 -p    (13:13:48) starting parallel lapw0 at Mon Jun  6 13:13:49 
SGT 2005
--------
running lapw0 in single mode
33.120u 0.220s 0:35.84 93.0%    0+0k 0+0io 2323pf+0w
 >   lapw1  -up -p       (13:14:24) starting parallel lapw1 at Mon Jun  6 
13:14:24 SGT 2005
->  starting parallel LAPW1 jobs at Mon Jun  6 13:14:24 SGT 2005
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
**  LAPW1 crashed!
0.570u 0.920s 3:39:01.13 0.0%   0+0k 0+0io 122254pf+0w

 >   stop error

I checked the error files and obtained:
**  Error in Parallel LAPW1
**  LAPW1 STOPPED at Mon Jun 6 16:53:25 SGT 2005
**  check ERROR FILES!
  Cholesky INFO =  4903
  'SECLR4' - POTRF (Scalapack/LAPACK) failed.

This error happen at only one of eight machines with error

  Cholesky INFO =  4903
  'SECLR4' - POTRF (Scalapack/LAPACK) failed.

No error were reported for the last 7 machines.

This error sometime appears at Cycle 3, sometime at cycle 1 as in this case.

Many thanks for help.

Regards,

Khuong



-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20050606/ba9a21d6/attachment.html


More information about the Wien mailing list