[Wien] Need help ; Crash during parallel execution.

Youngboem Cho jj2928 at naver.com
Sat Aug 29 13:58:24 CEST 2015


Hi. i have used wien2k for two months. (version of 14.2)
my machine has 8-core CPU composed of 18 threads with 64Gb memory.
so i had my machine execute lapw on parallel.

i set .machines as below.
-------------------
1:localhost
1:localhost
1:localhost
1:localhost
1:localhost
1:localhost
1:localhost
1:localhost
granularity:1
extrafine:1
--------------------

GotoBLAS2 is installed on my computer and i use gfortran + gcc.
(MPI & MKL are not installed.)
whenever i execute testpara, testpara1, testpara2, it shows me that 
there is no problem. even the command "top" on terminal shows me that 
all k-points are distributed well.
but the result is as below.


i wonder why this happened. is there anything wrong?

-sincerely Youngboem.

----------------------------------------------------------------------------

Calculating Ge2x2x2 in /media/neol/DATA/Ge2x2x2
on neol-felix with PID 23675
using WIEN2k_14.2 (Release 15/10/2014) in /home/neol/WIEN2k


     start     (2015. 08. 26. (수) 16:42:18 KST) with lapw0 (40/99 to go)

     cycle 1     (2015. 08. 26. (수) 16:42:18 KST)     (40/99 to go)

 >   lapw0 -p    (16:42:18) starting parallel lapw0 at 2015. 08. 26. 
(수) 16:42:18 KST
-------- .machine0 : processors
running lapw0 in single mode
249.6u 0.4s 4:10.45 99.8% 0+0k 0+0io 0pf+0w
 >   lapw1  -p        (16:46:29) starting parallel lapw1 at 2015. 08. 
26. (수) 16:46:29 KST
->  starting parallel LAPW1 jobs at 2015. 08. 26. (수) 16:46:29 KST
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
      localhost(4) 2818.3u 2.8s 47:29.28 99.0% 0+0k 0+0io 0pf+0w
      localhost(4) 2723.5u 2.5s 45:41.39 99.4% 0+0k 0+0io 0pf+0w
      localhost(4) 2817.4u 2.9s 47:03.20 99.8% 0+0k 0+0io 0pf+0w
      localhost(4) 2784.4u 2.1s 46:34.70 99.7% 0+0k 0+0io 0pf+0w
      localhost(4) 2806.5u 2.7s 47:15.89 99.0% 0+0k 0+0io 0pf+0w
      localhost(4) 2803.6u 2.4s 46:58.69 99.5% 0+0k 0+0io 0pf+0w
      localhost(4) 2760.0u 2.1s 46:06.23 99.8% 0+0k 0+0io 0pf+0w
      localhost(4) 2722.8u 2.6s 45:27.66 99.9% 0+0k 0+0io 0pf+0w
    Summary of lapw1para:
    localhost     k=32     user=22236.5     wallclock=22357
22241.8u 24.7s 47:30.45 781.1% 0+0k 0+600io 0pf+0w
 >   lapw2 -p         (17:33:59) running LAPW2 in parallel mode
       localhost 216.4u 1.8s 3:39.00 99.6% 0+0k 0+0io 0pf+0w
       localhost 286.8u 2.1s 4:49.76 99.7% 0+0k 0+0io 0pf+0w
       localhost 219.7u 1.7s 3:42.08 99.7% 0+0k 0+0io 0pf+0w
       localhost 251.3u 2.0s 4:13.99 99.7% 0+0k 0+0io 0pf+0w
       localhost 301.7u 2.0s 5:04.28 99.8% 0+0k 0+0io 0pf+0w
       localhost 274.9u 1.8s 4:37.40 99.7% 0+0k 0+0io 0pf+0w
       localhost 262.1u 2.1s 4:24.80 99.7% 0+0k 0+0io 0pf+0w
       localhost 228.3u 1.8s 3:50.62 99.7% 0+0k 0+0io 0pf+0w
    Summary of lapw2para:
    localhost     user=2041.2     wallclock=2061.93
2049.0u 16.1s 5:14.26 657.1% 0+0k 0+944io 0pf+0w
 >   lcore    (17:39:13) 0.7u 0.0s 0:00.79 96.2% 0+0k 0+0io 0pf+0w
 >   mixer     (17:39:15) 3.4u 0.1s 0:04.03 88.5% 0+0k 0+0io 0pf+0w
:ENERGY convergence:  0 0.0001 0
:CHARGE convergence:  0 0.0000 0
ec cc and fc_conv 0 1 1

     cycle 2     (2015. 08. 26. (수) 17:39:19 KST)     (39/98 to go)

 >   lapw0 -p    (17:39:19) starting parallel lapw0 at 2015. 08. 26. 
(수) 17:39:19 KST
-------- .machine0 : processors
running lapw0 in single mode
184.0u 0.1s 3:04.46 99.8% 0+0k 0+0io 0pf+0w
 >   lapw1  -p        (17:42:23) starting parallel lapw1 at 2015. 08. 
26. (수) 17:42:23 KST
->  starting parallel LAPW1 jobs at 2015. 08. 26. (수) 17:42:23 KST
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
      localhost(4) 2805.6u 2.6s 47:15.54 99.0% 0+0k 0+0io 0pf+0w
      localhost(4) 2651.7u 2.7s 44:23.47 99.6% 0+0k 0+0io 0pf+0w
      localhost(4) 2808.6u 2.4s 46:53.31 99.9% 0+0k 0+0io 0pf+0w
      localhost(4) 2772.6u 2.3s 46:46.95 98.8% 0+0k 0+0io 0pf+0w
      localhost(4) 2787.8u 2.5s 46:59.42 98.9% 0+0k 0+0io 0pf+0w
      localhost(4) 2766.4u 2.3s 46:39.33 98.9% 0+0k 0+0io 0pf+0w
      localhost(4) 2842.8u 2.7s 47:48.49 99.2% 0+0k 0+0io 0pf+0w
      localhost(4) 2780.3u 2.5s 47:06.06 98.4% 0+0k 0+0io 0pf+0w
    Summary of lapw1para:
    localhost     k=32     user=22215.8     wallclock=22432.6
22221.0u 24.2s 47:49.72 775.1% 0+0k 0+600io 0pf+0w
 >   lapw2 -p         (18:30:13) running LAPW2 in parallel mode
       localhost 298.9u 2.1s 5:02.66 99.4% 0+0k 0+0io 0pf+0w
       localhost 263.5u 2.1s 4:26.69 99.6% 0+0k 0+0io 0pf+0w
       localhost 231.8u 2.0s 3:54.89 99.5% 0+0k 0+0io 0pf+0w
       localhost 278.6u 2.0s 4:41.45 99.7% 0+0k 0+0io 0pf+0w
       localhost 275.1u 1.8s 4:38.13 99.5% 0+0k 0+0io 0pf+0w
       localhost 263.0u 2.1s 4:25.76 99.7% 0+0k 0+0io 0pf+0w
       localhost 236.7u 1.9s 4:00.33 99.3% 0+0k 0+0io 0pf+0w
       localhost 262.9u 1.9s 4:25.59 99.7% 0+0k 0+0io 0pf+0w
    Summary of lapw2para:
    localhost     user=2110.5     wallclock=2135.5
2119.3u 16.7s 5:13.33 681.7% 0+0k 0+944io 0pf+0w
 >   lcore    (18:35:27) 0.7u 0.0s 0:00.78 96.1% 0+0k 0+0io 0pf+0w
 >   mixer     (18:35:28) 3.6u 0.1s 0:04.22 89.3% 0+0k 0+0io 0pf+0w
:ENERGY convergence:  0 0.0001 0
:CHARGE convergence:  0 0.0000 0
ec cc and fc_conv 0 1 1

     cycle 3     (2015. 08. 26. (수) 18:35:32 KST)     (38/97 to go)

 >   lapw0 -p    (18:35:32) starting parallel lapw0 at 2015. 08. 26. 
(수) 18:35:32 KST
-------- .machine0 : processors
running lapw0 in single mode
181.0u 0.1s 3:01.59 99.7% 0+0k 0+0io 0pf+0w
 >   lapw1  -p        (18:38:34) starting parallel lapw1 at 2015. 08. 
26. (수) 18:38:34 KST
->  starting parallel LAPW1 jobs at 2015. 08. 26. (수) 18:38:34 KST
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
      localhost(4) 2861.5u 3.2s 48:16.32 98.9% 0+0k 0+8io 0pf+0w
      localhost(4) 2816.5u 2.4s 47:22.02 99.1% 0+0k 0+8io 0pf+0w
      localhost(4) 2719.0u 2.3s 45:49.59 98.9% 0+0k 0+0io 0pf+0w
      localhost(4) 2811.2u 2.5s 47:35.30 98.5% 0+0k 0+0io 0pf+0w
      localhost(4) 2761.7u 2.5s 46:21.44 99.3% 0+0k 0+0io 0pf+0w
      localhost(4) 2694.3u 2.6s 45:15.32 99.3% 0+0k 0+0io 0pf+0w
      localhost(4) 2806.1u 2.3s 47:27.10 98.6% 0+0k 0+0io 0pf+0w
      localhost(4) 2846.7u 3.5s 48:24.18 98.1% 0+0k 0+32io 0pf+0w
    Summary of lapw1para:
    localhost     k=32     user=22317     wallclock=22591.3
22323.4u 26.7s 48:25.46 769.2% 0+0k 0+648io 0pf+0w
 >   lapw2 -p         (19:26:59) running LAPW2 in parallel mode
       localhost 272.9u 1.6s 4:35.34 99.7% 0+0k 0+0io 0pf+0w
       localhost 250.0u 1.6s 4:12.25 99.7% 0+0k 0+0io 0pf+0w
       localhost 244.6u 1.7s 4:06.88 99.7% 0+0k 0+0io 0pf+0w
       localhost 224.6u 1.6s 3:46.89 99.7% 0+0k 0+0io 0pf+0w
       localhost 235.3u 1.7s 3:57.66 99.7% 0+0k 0+0io 0pf+0w
       localhost 215.4u 1.7s 3:37.68 99.7% 0+0k 0+0io 0pf+0w
       localhost 281.2u 1.7s 4:43.59 99.8% 0+0k 0+0io 0pf+0w
       localhost 241.3u 1.6s 4:03.48 99.8% 0+0k 0+0io 0pf+0w
    Summary of lapw2para:
    localhost     user=1965.3     wallclock=1983.77
1974.4u 14.2s 4:55.34 673.3% 0+0k 0+944io 0pf+0w
 >   lcore    (19:31:55) 0.7u 0.0s 0:00.83 93.9% 0+0k 0+0io 0pf+0w
 >   mixer     (19:31:56) 3.9u 0.1s 0:04.68 87.8% 0+0k 0+0io 0pf+0w
:ENERGY convergence:  0 0.0001 39.7416265550000000
:CHARGE convergence:  0 0.0000 7.8621609
ec cc and fc_conv 0 1 1

     cycle 4     (2015. 08. 26. (수) 19:32:01 KST)     (37/96 to go)

 >   lapw0 -p    (19:32:01) starting parallel lapw0 at 2015. 08. 26. 
(수) 19:32:01 KST
-------- .machine0 : processors
running lapw0 in single mode
181.7u 0.1s 3:02.15 99.8% 0+0k 0+0io 0pf+0w
 >   lapw1  -p        (19:35:03) starting parallel lapw1 at 2015. 08. 
26. (수) 19:35:03 KST
->  starting parallel LAPW1 jobs at 2015. 08. 26. (수) 19:35:03 KST
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
      localhost(4) 2586.8u 2.6s 43:23.09 99.4% 0+0k 0+0io 0pf+0w
      localhost(4) 2582.1u 2.2s 43:27.74 99.1% 0+0k 0+0io 0pf+0w
      localhost(4) 2585.8u 2.3s 43:11.17 99.8% 0+0k 0+0io 0pf+0w
      localhost(4) 2511.2u 2.3s 42:27.51 98.6% 0+0k 0+0io 0pf+0w
      localhost(4) 2530.6u 2.2s 42:49.29 98.5% 0+0k 0+0io 0pf+0w
      localhost(4) 2304.3u 3.1s 38:41.87 99.3% 0+0k 0+0io 0pf+0w
      localhost(4) 2471.1u 2.2s 41:30.60 99.3% 0+0k 0+0io 0pf+0w
      localhost(4) 2540.1u 2.7s 42:55.98 98.7% 0+0k 0+0io 0pf+0w
    Summary of lapw1para:
    localhost     k=32     user=20112     wallclock=20307.2
20117.0u 23.9s 43:28.94 771.9% 0+0k 0+600io 0pf+0w
 >   lapw2 -p         (20:18:32) running LAPW2 in parallel mode
       localhost 223.5u 1.4s 3:45.54 99.7% 0+0k 0+0io 0pf+0w
       localhost 210.1u 1.4s 3:33.38 99.1% 0+0k 0+0io 0pf+0w
       localhost 217.7u 1.4s 3:39.64 99.7% 0+0k 0+0io 0pf+0w
       localhost 241.3u 1.4s 4:04.55 99.2% 0+0k 0+0io 0pf+0w
       localhost 198.7u 1.3s 3:20.53 99.7% 0+0k 0+0io 0pf+0w
       localhost 225.8u 1.6s 3:48.14 99.7% 0+0k 0+0io 0pf+0w
       localhost 231.3u 1.4s 3:53.16 99.8% 0+0k 0+0io 0pf+0w
       localhost 221.8u 1.4s 3:45.30 99.0% 0+0k 0+0io 0pf+0w
    Summary of lapw2para:
    localhost     user=1770.2     wallclock=1790.24
1778.9u 12.1s 4:15.69 700.4% 0+0k 0+944io 0pf+0w
 >   lcore    (20:22:48) 0.7u 0.0s 0:00.79 96.2% 0+0k 0+0io 0pf+0w
 >   mixer     (20:22:49) 3.3u 0.1s 0:03.74 92.2% 0+0k 0+0io 0pf+0w
:ENERGY convergence:  0 0.0001 114.2739229950000000
:CHARGE convergence:  0 0.0000 11.7723529
ec cc and fc_conv 0 1 1

     cycle 5     (2015. 08. 26. (수) 20:22:53 KST)     (36/95 to go)

 >   lapw0 -p    (20:22:53) starting parallel lapw0 at 2015. 08. 26. 
(수) 20:22:53 KST
-------- .machine0 : processors
running lapw0 in single mode
164.1u 0.1s 2:44.56 99.8% 0+0k 0+0io 0pf+0w
 >   lapw1  -p        (20:25:37) starting parallel lapw1 at 2015. 08. 
26. (수) 20:25:37 KST
->  starting parallel LAPW1 jobs at 2015. 08. 26. (수) 20:25:37 KST
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
      localhost(4) 2525.1u 2.4s 42:18.11 99.5% 0+0k 0+0io 0pf+0w
      localhost(4) 2553.9u 2.1s 43:10.98 98.6% 0+0k 0+0io 0pf+0w
      localhost(4) 2479.3u 2.1s 41:41.79 99.1% 0+0k 0+0io 0pf+0w
      localhost(4) 2550.5u 2.0s 43:29.70 97.8% 0+0k 0+0io 0pf+0w
      localhost(4) 2469.4u 2.1s 41:27.45 99.3% 0+0k 0+0io 0pf+0w
      localhost(4) 2555.4u 2.1s 43:00.56 99.1% 0+0k 0+0io 0pf+0w
      localhost(4) 2536.0u 1.9s 43:03.91 98.2% 0+0k 0+0io 0pf+0w
      localhost(4) 2555.8u 2.2s 43:05.78 98.9% 0+0k 0+0io 0pf+0w
    Summary of lapw1para:
    localhost     k=32     user=20225.4     wallclock=20478.3
20231.0u 21.5s 43:30.90 775.6% 0+0k 0+600io 0pf+0w
 >   lapw2 -p         (21:09:08) running LAPW2 in parallel mode
       localhost 264.9u 1.4s 4:31.22 98.2% 0+0k 0+0io 0pf+0w
       localhost 255.9u 1.5s 4:18.03 99.8% 0+0k 0+0io 0pf+0w
       localhost 222.1u 1.3s 3:44.02 99.7% 0+0k 0+0io 0pf+0w
       localhost 269.1u 1.6s 4:34.77 98.5% 0+0k 0+0io 0pf+0w
       localhost 244.6u 1.5s 4:06.55 99.8% 0+0k 0+0io 0pf+0w
       localhost 286.2u 1.4s 4:55.16 97.4% 0+0k 0+0io 0pf+0w
       localhost 249.6u 1.5s 4:15.18 98.4% 0+0k 0+0io 0pf+0w
       localhost 242.7u 1.5s 4:07.49 98.7% 0+0k 0+0io 0pf+0w
    Summary of lapw2para:
    localhost     user=2035.1     wallclock=2072.42
2043.6u 12.6s 5:05.71 672.6% 0+0k 0+944io 0pf+0w
 >   lcore    (21:14:14) 0.6u 0.0s 0:00.67 94.0% 0+0k 0+0io 0pf+0w
 >   mixer     (21:14:15) 3.3u 0.1s 0:03.76 91.7% 0+0k 0+0io 0pf+0w
:ENERGY convergence:  0 0.0001 60.1611503500000000
:CHARGE convergence:  0 0.0000 8.5805961
ec cc and fc_conv 0 1 1

     cycle 6     (2015. 08. 26. (수) 21:14:19 KST)     (35/94 to go)

 >   lapw0 -p    (21:14:19) starting parallel lapw0 at 2015. 08. 26. 
(수) 21:14:19 KST
-------- .machine0 : processors
running lapw0 in single mode
164.5u 0.1s 2:44.97 99.7% 0+0k 0+0io 0pf+0w
 >   lapw1  -p        (21:17:04) starting parallel lapw1 at 2015. 08. 
26. (수) 21:17:04 KST
->  starting parallel LAPW1 jobs at 2015. 08. 26. (수) 21:17:04 KST
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
      localhost(4) 2685.6u 2.6s 44:56.74 99.6% 0+0k 0+96io 0pf+0w
      localhost(4) 2642.1u 2.2s 44:32.18 98.9% 0+0k 0+0io 0pf+0w
      localhost(4) 2537.8u 2.2s 42:28.62 99.6% 0+0k 0+0io 0pf+0w
      localhost(4) 2710.0u 2.4s 45:24.53 99.5% 0+0k 0+0io 0pf+0w
      localhost(4) 2499.7u 2.3s 42:07.24 99.0% 0+0k 0+0io 0pf+0w
      localhost(4) 2669.3u 2.3s 44:40.24 99.6% 0+0k 0+0io 0pf+0w
      localhost(4) 2729.5u 2.4s 45:55.05 99.1% 0+0k 0+0io 0pf+0w
      localhost(4) 2678.2u 2.1s 45:10.19 98.9% 0+0k 0+0io 0pf+0w
    Summary of lapw1para:
    localhost     k=32     user=21152.2     wallclock=21314.8
21157.8u 23.1s 45:56.28 768.4% 0+0k 0+696io 0pf+0w
 >   lapw2 -p         (22:03:00) running LAPW2 in parallel mode
       localhost 249.3u 1.8s 4:12.84 99.3% 0+0k 0+0io 0pf+0w
       localhost 271.0u 1.9s 4:33.84 99.6% 0+0k 0+0io 0pf+0w
       localhost 283.9u 2.1s 4:47.16 99.6% 0+0k 0+0io 0pf+0w
       localhost 264.8u 1.7s 4:27.55 99.6% 0+0k 0+0io 0pf+0w
       localhost 290.4u 1.9s 4:53.34 99.6% 0+0k 0+0io 0pf+0w
       localhost 294.9u 2.3s 4:59.15 99.3% 0+0k 0+0io 0pf+0w
       localhost 255.7u 1.8s 4:18.22 99.7% 0+0k 0+0io 0pf+0w
       localhost 254.9u 1.7s 4:17.99 99.4% 0+0k 0+0io 0pf+0w
    Summary of lapw2para:
    localhost     user=2164.9     wallclock=2190.09
2173.8u 16.1s 5:10.62 705.0% 0+0k 0+944io 0pf+0w
 >   lcore    (22:08:11) 0.6u 0.0s 0:00.69 97.1% 0+0k 0+0io 0pf+0w
 >   mixer     (22:08:12) 4.3u 0.2s 0:05.13 90.0% 0+0k 0+0io 0pf+0w
:ENERGY convergence:  0 0.0001 3.9070818400000000
:CHARGE convergence:  0 0.0000 10.3153690
ec cc and fc_conv 0 1 1

     cycle 7     (2015. 08. 26. (수) 22:08:17 KST)     (34/93 to go)

 >   lapw0 -p    (22:08:17) starting parallel lapw0 at 2015. 08. 26. 
(수) 22:08:17 KST
-------- .machine0 : processors
running lapw0 in single mode
186.7u 0.1s 3:07.23 99.8% 0+0k 0+0io 0pf+0w
 >   lapw1  -p        (22:11:24) starting parallel lapw1 at 2015. 08. 
26. (수) 22:11:24 KST
->  starting parallel LAPW1 jobs at 2015. 08. 26. (수) 22:11:24 KST
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
      localhost(4) 2864.9u 2.7s 48:40.63 98.1% 0+0k 0+0io 0pf+0w
      localhost(4) 2829.5u 2.2s 47:55.75 98.4% 0+0k 0+0io 0pf+0w
      localhost(4) 2772.3u 2.5s 47:05.47 98.2% 0+0k 0+0io 0pf+0w
      localhost(4) 2838.9u 2.2s 48:30.83 97.6% 0+0k 0+0io 0pf+0w
      localhost(4) 2771.7u 2.2s 47:01.68 98.3% 0+0k 0+0io 0pf+0w
      localhost(4) 2867.7u 2.5s 48:27.28 98.7% 0+0k 0+0io 0pf+0w
      localhost(4) 2824.7u 2.7s 47:57.10 98.2% 0+0k 0+0io 0pf+0w
      localhost(4) 2843.7u 2.7s 48:16.74 98.2% 0+0k 0+8io 0pf+0w
    Summary of lapw1para:
    localhost     k=32     user=22613.4     wallclock=23035.5
22619.4u 25.0s 48:41.85 775.0% 0+0k 0+608io 0pf+0w
 >   lapw2 -p         (23:00:06) running LAPW2 in parallel mode
       localhost 284.4u 2.0s 4:51.43 98.3% 0+0k 0+0io 0pf+0w
       localhost 291.5u 1.9s 4:57.96 98.4% 0+0k 0+0io 0pf+0w
       localhost 260.9u 2.0s 4:23.56 99.7% 0+0k 0+0io 0pf+0w
       localhost 293.7u 2.0s 4:57.04 99.5% 0+0k 0+0io 0pf+0w
       localhost 292.2u 1.9s 4:55.12 99.7% 0+0k 0+0io 0pf+0w
       localhost 262.6u 1.9s 4:25.88 99.5% 0+0k 0+0io 0pf+0w
       localhost 226.5u 1.8s 3:49.11 99.6% 0+0k 0+0io 0pf+0w
       localhost 296.6u 1.9s 5:03.27 98.4% 0+0k 0+0io 0pf+0w
    Summary of lapw2para:
    localhost     user=2208.4     wallclock=2243.37
2217.4u 16.2s 5:15.11 708.8% 0+0k 0+944io 0pf+0w
 >   lcore    (23:05:21) 0.7u 0.0s 0:00.79 97.4% 0+0k 0+0io 0pf+0w
 >   mixer     (23:05:23) 4.3u 0.1s 0:04.96 91.5% 0+0k 0+0io 0pf+0w
:ENERGY convergence:  0 0.0001 13.5467074350000000
:CHARGE convergence:  0 0.0000 7.4937242
ec cc and fc_conv 0 1 1

     cycle 8     (2015. 08. 26. (수) 23:05:28 KST)     (33/92 to go)

 >   lapw0 -p    (23:05:28) starting parallel lapw0 at 2015. 08. 26. 
(수) 23:05:28 KST
-------- .machine0 : processors
running lapw0 in single mode
186.2u 0.1s 3:06.70 99.8% 0+0k 0+0io 0pf+0w
 >   lapw1  -p        (23:08:35) starting parallel lapw1 at 2015. 08. 
26. (수) 23:08:35 KST
->  starting parallel LAPW1 jobs at 2015. 08. 26. (수) 23:08:35 KST
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
      localhost(4) 2893.4u 3.2s 48:35.86 99.3% 0+0k 0+0io 0pf+0w
      localhost(4) 2804.4u 2.4s 47:00.77 99.5% 0+0k 0+0io 0pf+0w
      localhost(4) 2810.1u 2.4s 47:02.46 99.6% 0+0k 0+0io 0pf+0w
      localhost(4) 2836.5u 2.3s 47:41.67 99.2% 0+0k 0+0io 0pf+0w
      localhost(4) 2827.5u 2.5s 47:42.39 98.8% 0+0k 0+0io 0pf+0w
      localhost(4) 2847.3u 2.3s 48:02.07 98.8% 0+0k 0+0io 0pf+0w
      localhost(4) 2846.3u 2.5s 47:50.00 99.2% 0+0k 0+0io 0pf+0w
      localhost(4) 2745.7u 2.7s 46:09.69 99.2% 0+0k 0+0io 0pf+0w
    Summary of lapw1para:
    localhost     k=32     user=22611.2     wallclock=22804.9
22616.9u 25.3s 48:37.01 776.2% 0+0k 0+600io 0pf+0w
 >   lapw2 -p         (23:57:12) running LAPW2 in parallel mode
       localhost 273.3u 1.9s 4:36.03 99.7% 0+0k 0+0io 0pf+0w
       localhost 293.1u 2.0s 4:55.70 99.8% 0+0k 0+0io 0pf+0w
       localhost 294.3u 2.0s 4:57.19 99.7% 0+0k 0+0io 0pf+0w
       localhost 258.3u 2.0s 4:21.62 99.5% 0+0k 0+0io 0pf+0w
       localhost 298.5u 2.1s 5:01.30 99.7% 0+0k 0+0io 0pf+0w
       localhost 269.4u 1.8s 4:31.91 99.7% 0+0k 0+0io 0pf+0w
       localhost 291.0u 2.0s 4:53.62 99.8% 0+0k 0+0io 0pf+0w
       localhost 239.5u 1.9s 4:02.73 99.4% 0+0k 0+0io 0pf+0w
    Summary of lapw2para:
    localhost     user=2217.4     wallclock=2240.1
2225.2u 16.6s 5:11.42 719.8% 0+0k 0+944io 0pf+0w
 >   lcore    (00:02:23) 0.7u 0.0s 0:00.82 96.3% 0+0k 0+0io 0pf+0w
 >   mixer     (00:02:25) 4.6u 0.2s 0:05.27 91.4% 0+0k 0+0io 0pf+0w
:ENERGY convergence:  0 0.0001 8.1820948200000000
:CHARGE convergence:  0 0.0000 8.5495585
ec cc and fc_conv 0 1 1

     cycle 9     (2015. 08. 27. (목) 00:02:30 KST)     (32/91 to go)

 >   lapw0 -p    (00:02:30) starting parallel lapw0 at 2015. 08. 27. 
(목) 00:02:30 KST
-------- .machine0 : processors
running lapw0 in single mode
188.7u 0.1s 3:09.13 99.8% 0+0k 0+0io 0pf+0w
 >   lapw1  -p        (00:05:39) starting parallel lapw1 at 2015. 08. 
27. (목) 00:05:39 KST
->  starting parallel LAPW1 jobs at 2015. 08. 27. (목) 00:05:39 KST
running LAPW1 in parallel mode (using .machines)
8 number_of_parallel_jobs
      localhost(4) 2927.2u 3.7s 49:10.73 99.3% 0+0k 0+0io 0pf+0w
      localhost(4) 2842.0u 2.3s 47:49.72 99.1% 0+0k 0+0io 0pf+0w
      localhost(4) 2825.1u 2.8s 47:20.44 99.5% 0+0k 0+0io 0pf+0w
      localhost(4) 2904.4u 2.5s 48:32.38 99.8% 0+0k 0+0io 0pf+0w
      localhost(4) 2895.6u 2.3s 48:46.56 99.0% 0+0k 0+0io 0pf+0w
      localhost(4) 2829.9u 2.6s 47:42.58 98.9% 0+0k 0+0io 0pf+0w
      localhost(4) 2818.2u 2.4s 47:25.09 99.1% 0+0k 0+0io 0pf+0w
      localhost(4) 2890.3u 2.9s 48:52.92 98.6% 0+0k 0+8io 0pf+0w
    Summary of lapw1para:
    localhost     k=32     user=22932.7     wallclock=23140.4
22938.7u 26.6s 49:11.84 778.0% 0+0k 0+608io 0pf+0w
 >   lapw2 -p         (00:54:51) running LAPW2 in parallel mode
       localhost 257.4u 1.8s 4:19.97 99.7% 0+0k 0+0io 0pf+0w
       localhost 225.7u 1.7s 3:48.00 99.7% 0+0k 0+0io 0pf+0w
       localhost 249.5u 1.8s 4:11.87 99.7% 0+0k 0+0io 0pf+0w
       localhost 255.4u 1.9s 4:18.02 99.7% 0+0k 0+0io 0pf+0w
       localhost 260.8u 2.1s 4:23.56 99.7% 0+0k 0+0io 0pf+0w
       localhost 248.5u 1.8s 4:10.80 99.8% 0+0k 0+0io 0pf+0w
       localhost 276.6u 1.6s 4:38.78 99.8% 0+0k 0+0io 0pf+0w
       localhost 254.5u 1.8s 4:16.82 99.8% 0+0k 0+0io 0pf+0w
    Summary of lapw2para:
    localhost     user=2028.4     wallclock=2047.82
**  LAPW2 crashed!
2029.2u 15.0s 4:41.11 727.2% 0+0k 0+928io 0pf+0w
error: command   /home/neol/WIEN2k/lapw2para lapw2.def   failed

 >   stop error




More information about the Wien mailing list