[CPMD-list] essl errors with lapack and essl on IBM_JS20.
Alessandro Curioni
cur at zurich.ibm.com
Fri Dec 8 17:17:44 CET 2006
Yes -
it looks like you are "merging" too much -
I suggest to compile lapack on you machine and then link with -lapack
-lessl ( as inthe IBM-JS20-ESSL-MPI conf)
if you want to use atlas from Marenostrum - please try to build
appropriate lapack merging the atlas lapack subset and then use/modify
the IBM-JS20-ATLAS-MPI conf.
Best performance is obtained using goto ( if you have it) + std lapack +
ESSL as in IBM-JS20-GOTO-MPI .
Best Regards,
Alessandro
Alessandro CURIONI, PhD
Research Staff Member
Computational Biochemistry and Material Science group
IBM Research Division - Zurich Research Laboratory
Saumerstrasse 4
8003 Rueschlikon - Switzerland
e-mail: cur at zurich.ibm.com
www: www.zurich.ibm.com
Tel: +41-1-7248633
Fax: +41-1-7248958
Joseph Hargitai <joseph.hargitai at nyu.edu>
Sent by: cpmd-list-bounces at cpmd.org
12/08/2006 05:02 PM
To
cpmd-list at cpmd.org
cc
Subject
[CPMD-list] essl errors with lapack and essl on IBM_JS20.
Dear All:
We are testing some cpmd runs on our IBM JS20 cluster. (PPC970, Redhat
4, Myrinet) The compile is
with lapack essl and mpi. Some runs work okay - others have this error,
which, unless we are mistaken can be traced back to essl
errors/misinterpretations.
We compiled with the following flags (had to include the qextname -
otherwise we had issues compiling. Maybe there is a better way)
We also tried, based on an old posting - llapack lessl llapack - link
ordering - same errors and both -DFFT_ESSL and -DFFT_DEFAULT
The lapack is standard, we also used other compiles with Atlas specially
compiled for JS20s - from Mare Nostrum. We merged the lapack libs with
the standard /usr/lib64 (maybe this is the issue - we would need a JS20
compiled Lapack)
(We are okay compiling on Sunfire 8 dual core cpus with 48GB memory -
using Pathscale and ACML.)
Compile script followed by the runtime error:
#INFO#
#INFO# IBM-JS20-MPI-64bit-essl
#INFO# you need lapack essl and goto 64 bit libraries
#INFO#
#INFO# A. Curioni
#INFO#
IRAT=2
FFLAGS=' -q64 -qmaxmem=32768 -qtune=ppc970 -qarch=ppc970
-qextname=dgelss:dsbevx:dstev:dsyev:dsyevx:dsysv:dtrtrs:printmemsize:zhbevx:zheev:zheevx:zhegv
'
FFLAGS_GROMOS=' -qarch=ppc970 -qdpc'
LFLAGS='-q64 -L/usr/local/lib -llapack -lessl \
-qarch=ppc970 '
CFLAGS=' -q64 -qarch=ppc970 '
CPP=' /opt/ibmcmp/xlf/9.1/exe/cpp -P'
CPPFLAGS=' -D__IBM -DLAPACK -DFFT_ESSL -D__PWRLinux -DMALLOC8
-DPOINTER8 -DPARALLEL=PARALLEL -DMP_LIBRARY=__MPI '
NOOPT_FLAG=' -O0 '
NOOPT_OBJS=' control.o memory.o freem.o'
AR='/usr/bin/ar ruv'
RANLIB='/usr/bin/ranlib'
if [ $debug ]; then
CC='xlc_r -c -g -C -qflttrap'
FC='/usr/local/mpich/1.2.6..0.94/mx/ppc64/smp/ibmcmp64/ssh/bin/mpif77 -c
-g -C -qflttrap'
LD='/usr/local/mpich/1.2.6..0.94/mx/ppc64/smp/ibmcmp64/ssh/bin/mpif77 -g
-C -qflttrap'
else
CC='xlc_r -c -O3 -qstrict'
FC='/usr/local/mpich/1.2.6..0.94/mx/ppc64/smp/ibmcmp64/ssh/bin/mpif77 -c
-O3 -qstrict'
LD='/usr/local/mpich/1.2.6..0.94/mx/ppc64/smp/ibmcmp64/ssh/bin/mpif77
-O3 -qstrict'
fi
-----------
Error running the code:
****************************************************************
* ATOM MASS RAGGIO NLCC PSEUDOPOTENTIAL *
* C 12.0112 1.2000 NO GOEDECKER S NONLOCAL *
* P NONLOCAL *
* D LOCAL *
* H 1.0080 1.2000 NO S LOCAL *
* Si 28.0860 1.2000 NO GOEDECKER S NONLOCAL *
* P NONLOCAL *
* D LOCAL *
****************************************************************
PARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARA
NCPU NGW NHG PLANES GXRAYS HXRAYS ORBITALS Z-PLANES
0 4570 36483 5 233 929 8 1
1 4569 36527 5 234 930 8 1
2 4567 36529 5 234 930 8 1
3 4567 36531 5 234 930 8 1
4 4567 36539 5 234 930 8 1
5 4569 36547 5 234 930 8 1
6 4569 36533 5 234 930 9 1
7 4569 36533 5 234 930 8 1
8 4569 36527 5 234 930 8 1
9 4569 36533 5 234 930 8 1
10 4567 36537 5 234 930 8 1
11 4567 36535 5 234 930 8 1
12 4565 36543 5 234 930 8 1
13 4565 36541 5 234 930 8 1
14 4565 36537 5 234 930 8 1
15 4569 36531 5 234 930 8 1
16 4570 36537 5 232 930 8 1
17 4570 36558 5 232 932 8 1
18 4572 36550 5 232 932 9 1
19 4572 36526 5 232 932 8 1
20 4574 36538 5 232 932 8 1
21 4572 36538 5 232 932 8 1
22 4568 36544 5 232 932 8 1
23 4566 36538 5 232 932 8 1
G=0 COMPONENT ON PROCESSOR : 0
PARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARA
*** LOADPA| SIZE OF THE PROGRAM IS 11164/ 95108 kBYTES ***
OPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPEN
NUMBER OF CPUS PER TASK 1
OPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPEN
*** RGGEN| SIZE OF THE PROGRAM IS 12296/ 96240 kBYTES ***
************************** SUPERCELL ***************************
SYMMETRY: ORTHORHOMBIC
LATTICE CONSTANT(a.u.): 29.23690
CELL DIMENSION: 29.2369 1.0000 2.0533 .0000 .0000 .0000
VOLUME(OMEGA IN BOHR^3): 51315.70009
LATTICE VECTOR A1(BOHR): 29.2369 .0000 .0000
LATTICE VECTOR A2(BOHR): .0000 29.2369 .0000
LATTICE VECTOR A3(BOHR): .0000 .0000 60.0327
RECIP. LAT. VEC. B1(2Pi/BOHR): .0342 .0000 .0000
RECIP. LAT. VEC. B2(2Pi/BOHR): .0000 .0342 .0000
RECIP. LAT. VEC. B3(2Pi/BOHR): .0000 .0000 .0167
REAL SPACE MESH: 120 120 252
WAVEFUNCTION CUTOFF(RYDBERG): 40.00000
DENSITY CUTOFF(RYDBERG): (DUAL= 4.00) 160.00000
NUMBER OF PLANE WAVES FOR WAVEFUNCTION CUTOFF: 109647
NUMBER OF PLANE WAVES FOR DENSITY CUTOFF: 876835
****************************************************************
*** RINFORCE| SIZE OF THE PROGRAM IS 16856/ 101200 kBYTES ***
*** FFTPRP| SIZE OF THE PROGRAM IS 20940/ 104392 kBYTES ***
GENERATE ATOMIC BASIS SET
C SLATER ORBITALS
2S ALPHA= 1.6083 OCCUPATION= 2.00
2P ALPHA= 1.5679 OCCUPATION= 2.00
H SLATER ORBITALS
1S ALPHA= 1.0000 OCCUPATION= 1.00
Si SLATER ORBITALS
3S ALPHA= 1.6344 OCCUPATION= 2.00
3P ALPHA= 1.4284 OCCUPATION= 2.00
INITIALIZATION TIME: 2.22 SECONDS
*** WFOPTS| SIZE OF THE PROGRAM IS 26228/ 168252 kBYTES ***
*** PHFAC| SIZE OF THE PROGRAM IS 26492/ 197268 kBYTES ***
*** ATOMWF| SIZE OF THE PROGRAM IS 37528/ 227624 kBYTES ***
ATRHO| CHARGE(R-SPACE): 388.000000 (G-SPACE): 388.000000
DSYGV : 2538-2015
The number of elements (ARG NO. 11) in a work array must be greater than
or equal to (2124173152).
DSYGV : 2538-2005
The size of the leading dimension (ARG NO. 5) of an array must be greater
than zero.
DSYGV : 2538-2006
The number of rows (ARG NO. 9) of a matrix must be less than or equal
to the size of the leading dimension (ARG NO. 5) of its array.
DSYGV : 2538-2006
The number of rows (ARG NO. 9) of a matrix must be less than or equal
to the size of the leading dimension (ARG NO. 8) of its array.
DSYGV : 2538-2099
End of input argument error reporting. For more information, refer to
Engineering and Scientific Subroutine Library Guide and Reference
(SA22-7904).
DSYGV : 2538-2604
Execution terminating due to error count for error number 2099.
DSYGV : 2538-2605
Message summary: 2005 - 1
DSYGV : 2538-2605
Message summary: 2006 - 2
DSYGV : 2538-2605
Message summary: 2015 - 1
DSYGV : 2538-2605
Message summary: 2099 - 1
best,
joseph hargitai
j
_______________________________________________
CPMD-list mailing list
CPMD-list at cpmd.org
http://cpmd.org/mailman/listinfo/cpmd-list
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://cpmd.org/pipermail/cpmd-list/attachments/20061208/d362378c/attachment.html
More information about the CPMD-list
mailing list