[Wien] running jobs without $SCRATCH
Stefaan Cottenier
Stefaan.Cottenier at fys.kuleuven.be
Fri Apr 6 22:03:01 CEST 2007
Probably you misunderstood the answer. With your granularity:4 you
cannot use $SCRATCH (that's why you ran into troubles). By putting
granularity:1 you can continue to use $SCRATCH, just the load
balancing of your system might not be optimal.
Stefaan
Quoting jadhikari at clarku.edu:
> Prof. P Blaha,
>
> Thank you very much for the answer.
>
> We have distributed memory system in our cluster with a master node and
> about 48 child nodes with 2 processers each. We use $SCRATCH of the nodes
> so that no jobs run on the master node (master node alone is not allowed).
>
> I have no idea about running jobs on the nodes without using $SCRATCH
> space. I will be very grateful for any suggestion regarding this.
>
> Waiting for the reply.
> Subin
>
>
>
>> Put granularity:1
>> This will evenly distribute the k-points at once (and not one after the
>> other, which is usefull for load ballencing, but you cannot use $SCRATCH)
>>
>>
>>
>> jadhikari at clarku.edu schrieb:
>>> Dear Wien users,
>>>
>>> I have a question concerning k point distribution.
>>> 24 IBZ points are evenly distributed to 3 nodes and 6 processers with
>>> each
>>> processor getting 4 IBZ points as shown below. (1 node has 2 processers)
>>>
>>> But this is not the situation always. Sometimes the no of k points that
>>> one processor get is more than that of others. And the system always
>>> crashes if this happens.
>>>
>>> Is there a way to control this inhomogeneity? All the processers are of
>>> equal speed. The .machines file is shown at the end.
>>>
>>> Thank you.
>>>
>>> Subin
>>>
>>> _________________________________________________________________________
>>> node2(1) 9.176u 0.092s 0:09.28 99.7% 0+0k 0+0io 0pf+0w
>>> node5(1) 9.715u 0.118s 0:10.50 93.5% 0+0k 0+0io 0pf+0w
>>> node9(1) 9.754u 0.130s 0:11.75 84.0% 0+0k 0+0io 0pf+0w
>>> node2(1) 10.918u 0.112s 0:17.80 61.9% 0+0k 0+0io 0pf+0w
>>> node5(1) 9.453u 0.114s 0:11.28 84.7% 0+0k 0+0io 0pf+0w
>>> node9(1) 9.995u 0.117s 0:13.79 73.2% 0+0k 0+0io 0pf+0w
>>> node2(1) 9.286u 0.095s 0:09.40 99.6% 0+0k 0+0io 0pf+0w
>>> node5(1) 11.702u 0.115s 0:12.99 90.9% 0+0k 0+0io 0pf+0w
>>> node9(1) 9.336u 0.110s 0:16.29 57.9% 0+0k 0+0io 0pf+0w
>>> node2(1) 9.403u 0.111s 0:15.62 60.8% 0+0k 0+0io 0pf+0w
>>> node5(1) 11.607u 0.116s 0:15.94 73.4% 0+0k 0+0io 0pf+0w
>>> node9(1) 9.595u 0.119s 0:13.52 71.7% 0+0k 0+0io 0pf+0w
>>> node2(1) 9.207u 0.112s 0:10.64 87.5% 0+0k 0+0io 0pf+0w
>>> node5(1) 11.135u 0.124s 0:14.81 75.9% 0+0k 0+0io 0pf+0w
>>> node9(1) 9.985u 0.114s 0:16.91 59.6% 0+0k 0+0io 0pf+0w
>>> node2(1) 10.602u 0.118s 0:18.33 58.4% 0+0k 0+0io 0pf+0w
>>> node5(1) 11.476u 0.106s 0:16.98 68.1% 0+0k 0+0io 0pf+0w
>>> node9(1) 9.325u 0.100s 0:13.75 68.5% 0+0k 0+0io 0pf+0w
>>> node2(1) 9.447u 0.109s 0:10.03 95.1% 0+0k 0+0io 0pf+0w
>>> node5(1) 9.997u 0.115s 0:11.08 91.1% 0+0k 0+0io 0pf+0w
>>> node9(1) 10.821u 0.119s 0:19.06 57.3% 0+0k 0+0io 0pf+0w
>>> node2(1) 9.400u 0.097s 0:13.84 68.5% 0+0k 0+0io 0pf+0w
>>> node5(1) 11.749u 0.130s 0:17.38 68.2% 0+0k 0+0io 0pf+0w
>>> node9(1) 9.436u 0.112s 0:12.45 76.6% 0+0k 0+0io 0pf+0w
>>> Summary of lapw1para:
>>> node2 k=8 user=77.439 wallclock=104.94
>>> node5 k=8 user=86.834 wallclock=110.96
>>> node9 k=8 user=78.247 wallclock=117.52
>>> node2 k=8 user=77.439 wallclock=104.94
>>> node5 k=8 user=86.834 wallclock=110.96
>>> node9 k=8 user=78.247 wallclock=117.52
>>> _________________________________________________________
>>> .machine file
>>>
>>> 1:node2
>>> 1:node5
>>> 1:node9
>>> 1:node2
>>> 1:node5
>>> 1:node9
>>> granularity:4
>>> extrafine:1
>>>
>>> _______________________________________________
>>> Wien mailing list
>>> Wien at zeus.theochem.tuwien.ac.at
>>> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
>> _______________________________________________
>> Wien mailing list
>> Wien at zeus.theochem.tuwien.ac.at
>> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
>>
>>
>
> _______________________________________________
> Wien mailing list
> Wien at zeus.theochem.tuwien.ac.at
> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
>
>
--
Stefaan Cottenier
Instituut voor Kern- en Stralingsfysica
K.U.Leuven
Celestijnenlaan 200 D
B-3001 Leuven (Belgium)
tel: + 32 16 32 71 45
fax: + 32 16 32 79 85
e-mail: stefaan.cottenier at fys.kuleuven.be
Disclaimer: http://www.kuleuven.be/cwis/email_disclaimer.htm
More information about the Wien
mailing list