[Wien] I/O buffer

Peter Blaha pblaha at zeus.theochem.tuwien.ac.at
Wed Oct 20 15:23:29 CEST 2004


It is a problem of the machine. Most likely  NFS or a possible harddisk 
failure.

Check your NFS, put longer "sleep" times in lapw1/2para. 
Make a check:
after lapw0 in run_lapw put a   cp case.vns case.vns_save   line
When the crash occurs in vns, compare the two files.

> I compiled Wien2k code (WIEN2k_04.6 (Release
> 21/06/2004)) on supercomputer (redhat 9.0). Compilers
> are f95i -v 7.1 and cc. There was no compiler errors.
> And It worked perfectly well on sigle CPU. And there
> was no any errors running big jobs in parallel without
> queue system. But when I submited a job in queueing
> system (SEG), the job can run in parallel without any
> errors at the first few iterations, and then the job
> was crashed in different parts of a iteration, "LAPW1
> CRASHED" or "LAPW2 CRASHED". 
> We can't image which iteration the job will crashed.  
> 
> 1) For the first case "LAPW1 crashed", we checked the
> information from queueing system. 
> 
> Input/Output Error 148: Invalid character
> 
>    In Procedure: atpar
>         At Line: 350
> 
>       Statement: Formatted READ
>            Unit: 19
>    Connected To: 113.vns
>            Form: Formatted
>          Access: Sequential
> Records Read   : 83398
> Records Written: 0
> 
> Current I/O Buffer:
> 
>     0.429403044260E-1`³lÄ456758936985E-11
> 0.486249676219E-11 0.517302673686E-11
>                      !
> 
> 
> End of diagnostics
> 
> 
> 2) For the second case "lapw2 crashed", the
> information is : 
> 
> 
> Input/Output Error 148: Invalid character
> 
>    In Procedure: main program
>         At Line: 251
> 
>       Statement: Formatted READ
>            Unit: 17
>    Connected To: 113.vrespval_8
>            Form: Formatted
>          Access: Sequential
> Records Read   : 22958
> Records Written: 0
> 
> Current I/O Buffer:
> 
>     0.782512321234E-06 0.836779422558E-06
> 0.894572956491E-06 0.9561`³lÄ9228E-06
>                                                       
>             !
> 
> 
> End of diagnostics
> 
> cp: cannot stat `.in.tmp': No such file or directory
> rm: cannot remove `.in.tmp': No such file or directory
> rm: cannot remove `.in.tmp1': No such file or
> directory
> 
> I don't know why some files contains invalid
> characters. Is it the problem of machines or problem
> of code? 
> 
> Thanks
> 
> Zhang
> 
> 
> 		
> _______________________________
> Do you Yahoo!?
> Declare Yourself - Register online to vote today!
> http://vote.yahoo.com
> 
> _______________________________________________
> Wien mailing list
> Wien at zeus.theochem.tuwien.ac.at
> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
> 


                                      P.Blaha
--------------------------------------------------------------------------
Peter BLAHA, Inst.f. Materials Chemistry, TU Vienna, A-1060 Vienna
Phone: +43-1-58801-15671             FAX: +43-1-58801-15698
Email: blaha at theochem.tuwien.ac.at    WWW: http://info.tuwien.ac.at/theochem/
--------------------------------------------------------------------------




More information about the Wien mailing list