[Wien] Running Parallel Jobs
jadhikari@clarku.edu
jadhikari at clarku.edu
Wed Oct 18 00:46:15 CEST 2006
Hello,
Thank you very much for the previous helps.
We are trying to run a job in a parallel mode for the first time.
But it never runs past the LAPW1 step.
Here is the PBS script-
________________________________________
#!/bin/sh
#PBS -l nodes=2:ppn=2:myrinet
#PBS -j oe
cd /home/lsmith/rut
rm -f .machines
awk '{print "1:"$1}' $PBS_NODEFILE > .machines
echo "granularity:1" >> .machines
/usr/local/mpich/bin/mpirun -np 4 -machinefile $PBS_NODEFILE
/usr/opt/WIEN2k/run_lapw -p
________________________________________
Error message-
Missing: program name
LAPW0 END
LAPW1 END
LAPW1 END
LAPW1 END
LAPW1 END
LAPW1 END
Also we got another error as follows-
--Cholesky INFO = 895
'SECLR4' - POTRF (Scalapack/LAPACK) failed.
Is it the case.in1 error or the script error? We could not figure it out.
The nodes run for 3-4 minutes and then stop running. TESTPARA1 shows this
error.
Regards,
Subin Adhikari
More information about the Wien
mailing list