[Wien] Lapw1_mpi
Florent Boucher
Florent.Boucher at cnrs-imn.fr
Mon Dec 15 09:48:27 CET 2003
Dear Peter,
I did the test with the "old" lapw1_mpi and it works.
Here are the performances. I choose a matrix that is not too big, just
for test.
If you want a test on a larger system, just let me know.
Regards
Florent
MATRIX SIZE 5908
2 CPU :
Working with 1 x 2 Process Grid
My ID = 0
DESCPANEL, DESCHS, DESCZ (2): 0 1 1
Time for al,bl: 0.207031250000000
nlo 644
Time for los: 5.25390625000000
Time for LOOP 260: 71.0585937500000
HNS: Loop 140: 1.1 Loop 90: 16.5 matrix update:
163.3loop 30: 15.4(wall time)
Seclr4(Cholesky complete (CPU)) : 20.988 3275.091 Mflops
Seclr4(Cholesky complete (WALL)) : 20.990 3274.894 Mflops
Seclr4(Transform to eig.problem (CPU)) : 63.516 3246.690 Mflops
Seclr4(Transform to eig.problem (WALL)) : 63.521 3246.440 Mflops
Seclr4(Compute eigenvalues (CPU)) : 269.301 1020.993 Mflops
Seclr4(Compute eigenvalues (WALL)) : 269.299 1020.998 Mflops
Seclr4(Backtransform (CPU)) : 5.832 888.766 Mflops
Seclr4(Backtransform (WALL)) : 5.832 888.793 Mflops
TIME HAMILT (CPU) = 76.7, HNS = 181.0, DIAG = 359.6
TIME HAMILT (WALL) = 76.7, HNS = 181.0, DIAG = 359.6
4 CPU :
Elapse Time : 355s
Memory : 4 x 398Mo = 1592 Mo
Working with 2 x 2 Process Grid
My ID = 0
DESCPANEL, DESCHS, DESCZ (2): 0 1 1
Time for al,bl: 0.208984375000000
nlo 644
Time for los: 6.06445312500000
Time for LOOP 260: 45.3515625000000
HNS: Loop 140: 1.1 Loop 90: 16.6 matrix update:
86.6loop 30: 15.5(wall time)
Seclr4(Cholesky complete (CPU)) : 10.967 6267.876 Mflops
Seclr4(Cholesky complete (WALL)) : 10.967 6267.665 Mflops
Seclr4(Transform to eig.problem (CPU)) : 33.383 6177.298 Mflops
Seclr4(Transform to eig.problem (WALL)) : 33.387 6176.576 Mflops
Seclr4(Compute eigenvalues (CPU)) : 130.611 2105.132 Mflops
Seclr4(Compute eigenvalues (WALL)) : 130.611 2105.138 Mflops
Seclr4(Backtransform (CPU)) : 3.006 1724.403 Mflops
Seclr4(Backtransform (WALL)) : 3.007 1723.913 Mflops
TIME HAMILT (CPU) = 51.9, HNS = 104.3, DIAG = 178.0
TIME HAMILT (WALL) = 51.9, HNS = 104.3, DIAG = 178.0
K= 0.50000 0.50000 0.50000 1
MATRIX SIZE 5908 WEIGHT= 1.00 PGR:
8 CPU :
Elapse Time : 242s
Memory : 8 x 294Mo = 2352 Mo
DESCPANEL, DESCHS, DESCZ (2): 0 1 1
Time for al,bl: 0.208984375000000
nlo 644
Time for los: 5.95117187500000
Time for LOOP 260: 34.2382812500000
HNS: Loop 140: 1.1 Loop 90: 16.6 matrix update:
53.0loop 30: 15.5(wall time)
Seclr4(Cholesky complete (CPU)) : 6.273 10957.075 Mflops
Seclr4(Cholesky complete (WALL)) : 6.275 10954.326 Mflops
Seclr4(Transform to eig.problem (CPU)) : 19.354 10655.200 Mflops
Seclr4(Transform to eig.problem (WALL)) : 19.355 10654.167 Mflops
Seclr4(Compute eigenvalues (CPU)) : 83.008 3312.388 Mflops
Seclr4(Compute eigenvalues (WALL)) : 83.006 3312.453 Mflops
Seclr4(Backtransform (CPU)) : 2.012 2576.559 Mflops
Seclr4(Backtransform (WALL)) : 2.012 2576.640 Mflops
TIME HAMILT (CPU) = 40.6, HNS = 70.7, DIAG = 110.6
TIME HAMILT (WALL) = 40.6, HNS = 70.7, DIAG = 110.6
K= 0.50000 0.50000 0.50000 1
16 CPU :
Elapse Time : 146s
Memory : 16 x 240Mo = 3840 Mo
Working with 4 x 4 Process Grid
My ID = 0
DESCPANEL, DESCHS, DESCZ (2): 0 1 1
Time for al,bl: 0.212890625000000
nlo 644
Time for los: 4.41992187500000
Time for LOOP 260: 14.3320312500000
HNS: Loop 140: 1.1 Loop 90: 17.3 matrix update:
30.3loop 30: 16.2(wall time)
Seclr4(Cholesky complete (CPU)) : 3.400 20214.891 Mflops
Seclr4(Cholesky complete (WALL)) : 3.401 20213.456 Mflops
Seclr4(Transform to eig.problem (CPU)) : 11.016 18720.279 Mflops
Seclr4(Transform to eig.problem (WALL)) : 11.017 18717.209 Mflops
Seclr4(Compute eigenvalues (CPU)) : 46.498 5913.240 Mflops
Seclr4(Compute eigenvalues (WALL)) : 46.498 5913.262 Mflops
Seclr4(Backtransform (CPU)) : 0.953 5438.230 Mflops
Seclr4(Backtransform (WALL)) : 0.953 5438.961 Mflops
TIME HAMILT (CPU) = 19.2, HNS = 48.0, DIAG = 61.9
TIME HAMILT (WALL) = 19.2, HNS = 48.0, DIAG = 61.9
--
--------------------------------------------------------------------------
| Florent BOUCHER | |
| Institut des Matériaux Jean Rouxel | Mailto:Florent.Boucher at cnrs-imn.fr |
| 2, rue de la Houssinière | Phone: (33) 2 40 37 39 24 |
| BP 32229 | Fax: (33) 2 40 37 39 95 |
| 44322 NANTES CEDEX 3 (FRANCE) | http://www.cnrs-imn.fr |
--------------------------------------------------------------------------
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Lapw1_mpi_2.pdf
Type: application/pdf
Size: 18491 bytes
Desc: not available
Url : http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20031215/5c4c0cb1/Lapw1_mpi_2.pdf
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Lapw1_mpi_1.pdf
Type: application/pdf
Size: 19167 bytes
Desc: not available
Url : http://zeus.theochem.tuwien.ac.at/pipermail/wien/attachments/20031215/5c4c0cb1/Lapw1_mpi_1.pdf
More information about the Wien
mailing list