**parallel questions** |
- Date: 2006/05/01 06:02
- Name:
**Jeason**
- Dear Prof. Ozaki,
I think I have succesfully compiled the code in parallel. The enviroument is: icc9.0, acml3.1, lam7.0.6, Opeteron2.0G
But when doing test, I find two problems:
1. When it parallel running, I require the same or more memory /per node than series run.
2. When I running the auto-test run, I get result:
1)running in single cpu, series run: 1 input_example/Benzene.dat Elapsed time(s)= 29.09 diff Utot= 0.000000000000 diff Force= 0.000000000001 2 input_example/C60.dat Elapsed time(s)= 364.59 diff Utot= 0.000000000001 diff Force= 0.000000000001 3 input_example/Cdia.dat Elapsed time(s)= 24.06 diff Utot= 0.000000000001 diff Force= 0.000000002656 4 input_example/CO.dat Elapsed time(s)= 130.29 diff Utot= 0.000000000000 diff Force= 0.000000000033 5 input_example/Cr2.dat Elapsed time(s)= 51.81 diff Utot= 0.000000000000 diff Force= 0.000000000000 6 input_example/Crys-MnO.dat Elapsed time(s)= 523.67 diff Utot= 0.000000000032 diff Force= 0.000000000060 7 input_example/GaAs.dat Elapsed time(s)= 337.79 diff Utot= 0.000000000074 diff Force= 0.000000001227 8 input_example/Glycine.dat Elapsed time(s)= 50.35 diff Utot= 0.000000000001 diff Force= 0.000000000001 9 input_example/Graphite4.dat Elapsed time(s)= 22.74 diff Utot= 0.000000000064 diff Force= 0.000000000213 10 input_example/H2O.dat Elapsed time(s)= 24.65 diff Utot= 0.000000000000 diff Force= 0.000000000001 11 input_example/H2O-EF.dat Elapsed time(s)= 37.07 diff Utot= 0.000000000000 diff Force= 0.000000000197 12 input_example/HYb.dat Elapsed time(s)= 100.95 diff Utot= 0.000000000002 diff Force= 0.000000000000 13 input_example/Methane.dat Elapsed time(s)= 18.26 diff Utot= 0.000000000045 diff Force= 0.000000004540 1 input_example/Benzene.dat Elapsed time(s)= 68.58 diff Utot= 0.000000000000 diff Force= 0.000000000001 2 input_example/C60.dat Elapsed time(s)= 381.21 diff Utot= 0.000000000010 diff Force= 0.000000000005 3 input_example/Cdia.dat Elapsed time(s)= 19.10 diff Utot= 0.000000000000 diff Force= 0.000000002025 4 input_example/CO.dat Elapsed time(s)= 113.78 diff Utot= 0.000000000000 diff Force= 0.000000000035 5 input_example/Cr2.dat Elapsed time(s)= 41.63 diff Utot= 0.000000000001 diff Force= 0.000000000000 6 input_example/Crys-MnO.dat Elapsed time(s)= 268.52 diff Utot= 0.000000000007 diff Force= 0.000000000219 7 input_example/GaAs.dat Elapsed time(s)= 404.82 diff Utot= 0.000000000123 diff Force= 0.000000000756 8 input_example/Glycine.dat Elapsed time(s)= 81.26 diff Utot= 0.000000000001 diff Force= 0.000000000001 9 input_example/Graphite4.dat Elapsed time(s)= 18.24 diff Utot= 0.000000000012 diff Force= 0.000000001777 10 input_example/H2O.dat Elapsed time(s)= 19.13 diff Utot= 0.000000000000 diff Force= 0.000000000001 11 input_example/H2O-EF.dat Elapsed time(s)= 26.73 diff Utot= 0.000000000002 diff Force= 0.000000000140 12 input_example/HYb.dat Elapsed time(s)= 121.02 diff Utot= 0.000000000001 diff Force= 0.000000000001 13 input_example/Methane.dat Elapsed time(s)= 23.37 diff Utot= 0.000000000001 diff Force= 0.000000000095 14 input_example/Mol_MnO.dat Elapsed time(s)= 169.46 diff Utot= 0.000000000013 diff Force= 0.000000000009
Which almost the same effiency. I want to know why
Could you help me?
Thanks 14 input_example/Mol_MnO.dat Elapsed time(s)= 161.71 diff Utot= 0.000000000026 diff Force= 0.000000000046
2) in 6 cpus, parallel run:
| |