[CPMD-list] essl errors with lapack and essl on IBM_JS20.

Alessandro Curioni cur at zurich.ibm.com
Fri Dec 8 21:43:17 CET 2006


Joseph

 the problem is only how and with which compiler your "std" lapack were 
compiled .....
you can use the goto using JS20 compiled lapack - but then you have to had 
-qextname for the  blas in the goto ( as is clear from the conf file) 
because   the goto where not compiled with xlf ....


ciao

Alessandro CURIONI, PhD
Research Staff Member
Computational Biochemistry and Material Science group
IBM Research Division - Zurich Research Laboratory
Saumerstrasse 4
8003 Rueschlikon - Switzerland
e-mail: cur at zurich.ibm.com
www:    www.zurich.ibm.com
Tel: +41-1-7248633
Fax: +41-1-7248958




Joseph Hargitai <joseph.hargitai at nyu.edu> 
Sent by: cpmd-list-bounces at cpmd.org
12/08/2006 05:45 PM

To
Alessandro Curioni <cur at zurich.ibm.com>, cpmd-list at cpmd.org
cc

Subject
Re: [CPMD-list] essl errors with lapack and essl on IBM_JS20.






Yesterday with the help of Axel we made the lapack essl version work by
adding dsygv to the -qextname list.  However, as Axel pointed out - the
-qextname list should only really have -printmemsize - the rest should
be picked up without an issue. (which is not our experience)

For my understanding: 

JS20 compiled Lapack with essl should compile without a long -qextname
list vs std Lapack - which we are using in this case?

As far as Goto - we have it - just to make sure - use standard Lapack
with Goto, not JS20? 

j

Yes - 
it looks like you are  "merging"  too much - 

I suggest to compile lapack on you machine and  then link with -lapack 
-lessl   ( as inthe IBM-JS20-ESSL-MPI conf) 

if you want to use atlas from Marenostrum -  please try to build 
appropriate lapack  merging the atlas lapack subset and then use/modify 
the IBM-JS20-ATLAS-MPI conf. 

Best performance is obtained using goto ( if you have it) + std lapack + 
ESSL as in IBM-JS20-GOTO-MPI . 

Best Regards, 

Alessandro 


Alessandro CURIONI, PhD
Research Staff Member
Computational Biochemistry and Material Science group
IBM Research Division - Zurich Research Laboratory
Saumerstrasse 4
8003 Rueschlikon - Switzerland
e-mail: cur at zurich.ibm.com
www:    www.zurich.ibm.com
Tel: +41-1-7248633
Fax: +41-1-7248958



Joseph Hargitai <joseph.hargitai at nyu.edu> 
Sent by: cpmd-list-bounces at cpmd.org 
12/08/2006 05:02 PM 


To
cpmd-list at cpmd.org 
cc

Subject
[CPMD-list] essl errors with lapack and essl on IBM_JS20.










Dear All:

We are testing some cpmd runs on our IBM JS20 cluster. (PPC970, Redhat
4, Myrinet) The compile is
with lapack essl and mpi. Some runs work okay - others have this error,
which, unless we are mistaken can be traced back to essl
errors/misinterpretations.

We compiled with the following flags (had to include the qextname -
otherwise we had issues compiling. Maybe there is a better way) 

We also tried, based on an old posting - llapack lessl llapack  - link
ordering - same errors and both -DFFT_ESSL and -DFFT_DEFAULT

The lapack is standard, we also used other compiles with Atlas specially
compiled for JS20s - from Mare Nostrum. We merged the lapack libs with
the standard /usr/lib64 (maybe this is the issue - we would need a JS20
compiled Lapack)

(We are okay compiling on Sunfire 8 dual core cpus with 48GB memory -
using Pathscale and ACML.)



Compile script followed by the runtime error:

#INFO# 
#INFO#  IBM-JS20-MPI-64bit-essl
#INFO#  you need lapack essl and goto 64 bit libraries
#INFO# 
#INFO#  A. Curioni
#INFO# 
    IRAT=2
    FFLAGS=' -q64 -qmaxmem=32768 -qtune=ppc970 -qarch=ppc970
-qextname=dgelss:dsbevx:dstev:dsyev:dsyevx:dsysv:dtrtrs:printmemsize:zhbevx:zheev:zheevx:zhegv
'
    FFLAGS_GROMOS=' -qarch=ppc970 -qdpc'
    LFLAGS='-q64 -L/usr/local/lib  -llapack -lessl \
           -qarch=ppc970 '
    CFLAGS=' -q64  -qarch=ppc970 '
    CPP=' /opt/ibmcmp/xlf/9.1/exe/cpp -P'
    CPPFLAGS=' -D__IBM -DLAPACK -DFFT_ESSL -D__PWRLinux -DMALLOC8
-DPOINTER8 -DPARALLEL=PARALLEL -DMP_LIBRARY=__MPI  '
    NOOPT_FLAG=' -O0 '
    NOOPT_OBJS=' control.o memory.o freem.o'
    AR='/usr/bin/ar ruv'
    RANLIB='/usr/bin/ranlib'
    if [ $debug ]; then
      CC='xlc_r -c -g -C -qflttrap'
 
FC='/usr/local/mpich/1.2.6..0.94/mx/ppc64/smp/ibmcmp64/ssh/bin/mpif77 -c
-g -C -qflttrap'
 
LD='/usr/local/mpich/1.2.6..0.94/mx/ppc64/smp/ibmcmp64/ssh/bin/mpif77 -g
-C -qflttrap'
    else
      CC='xlc_r -c -O3 -qstrict'
 
FC='/usr/local/mpich/1.2.6..0.94/mx/ppc64/smp/ibmcmp64/ssh/bin/mpif77 -c
-O3 -qstrict'
 
LD='/usr/local/mpich/1.2.6..0.94/mx/ppc64/smp/ibmcmp64/ssh/bin/mpif77
-O3 -qstrict'
    fi
-----------

Error running the code:


****************************************************************
*   ATOM       MASS   RAGGIO NLCC              PSEUDOPOTENTIAL *
*      C    12.0112   1.2000   NO   GOEDECKER      S  NONLOCAL *
*                                                  P  NONLOCAL *
*                                                  D     LOCAL *
*      H     1.0080   1.2000   NO                  S     LOCAL *
*     Si    28.0860   1.2000   NO   GOEDECKER      S  NONLOCAL *
*                                                  P  NONLOCAL *
*                                                  D     LOCAL *
****************************************************************


PARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARA
 NCPU     NGW     NHG  PLANES  GXRAYS  HXRAYS ORBITALS Z-PLANES
    0    4570   36483       5     233     929       8       1
    1    4569   36527       5     234     930       8       1
    2    4567   36529       5     234     930       8       1
    3    4567   36531       5     234     930       8       1
    4    4567   36539       5     234     930       8       1
    5    4569   36547       5     234     930       8       1
    6    4569   36533       5     234     930       9       1
    7    4569   36533       5     234     930       8       1
    8    4569   36527       5     234     930       8       1
    9    4569   36533       5     234     930       8       1
   10    4567   36537       5     234     930       8       1
   11    4567   36535       5     234     930       8       1
   12    4565   36543       5     234     930       8       1
   13    4565   36541       5     234     930       8       1
   14    4565   36537       5     234     930       8       1
   15    4569   36531       5     234     930       8       1
   16    4570   36537       5     232     930       8       1
   17    4570   36558       5     232     932       8       1
   18    4572   36550       5     232     932       9       1
   19    4572   36526       5     232     932       8       1
   20    4574   36538       5     232     932       8       1
   21    4572   36538       5     232     932       8       1
   22    4568   36544       5     232     932       8       1
   23    4566   36538       5     232     932       8       1
               G=0 COMPONENT ON PROCESSOR :     0
PARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARAPARA

***    LOADPA| SIZE OF THE PROGRAM IS   11164/  95108 kBYTES ***

OPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPEN
NUMBER OF CPUS PER TASK                                        1
OPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPENMPOPEN

***     RGGEN| SIZE OF THE PROGRAM IS   12296/  96240 kBYTES ***

************************** SUPERCELL ***************************
SYMMETRY:                                           ORTHORHOMBIC
LATTICE CONSTANT(a.u.):                                 29.23690
CELL DIMENSION:  29.2369  1.0000  2.0533   .0000   .0000   .0000
VOLUME(OMEGA IN BOHR^3):                             51315.70009
LATTICE VECTOR A1(BOHR):           29.2369      .0000      .0000
LATTICE VECTOR A2(BOHR):             .0000    29.2369      .0000
LATTICE VECTOR A3(BOHR):             .0000      .0000    60.0327
RECIP. LAT. VEC. B1(2Pi/BOHR):       .0342      .0000      .0000
RECIP. LAT. VEC. B2(2Pi/BOHR):       .0000      .0342      .0000
RECIP. LAT. VEC. B3(2Pi/BOHR):       .0000      .0000      .0167
REAL SPACE MESH:                   120          120          252
WAVEFUNCTION CUTOFF(RYDBERG):                           40.00000
DENSITY CUTOFF(RYDBERG):          (DUAL= 4.00)         160.00000
NUMBER OF PLANE WAVES FOR WAVEFUNCTION CUTOFF:            109647
NUMBER OF PLANE WAVES FOR DENSITY CUTOFF:                 876835
****************************************************************

***  RINFORCE| SIZE OF THE PROGRAM IS   16856/ 101200 kBYTES ***
***    FFTPRP| SIZE OF THE PROGRAM IS   20940/ 104392 kBYTES ***

GENERATE ATOMIC BASIS SET
     C        SLATER ORBITALS
       2S        ALPHA=   1.6083      OCCUPATION= 2.00
       2P        ALPHA=   1.5679      OCCUPATION= 2.00
     H        SLATER ORBITALS
       1S        ALPHA=   1.0000      OCCUPATION= 1.00
    Si        SLATER ORBITALS
       3S        ALPHA=   1.6344      OCCUPATION= 2.00
       3P        ALPHA=   1.4284      OCCUPATION= 2.00


INITIALIZATION TIME:                                2.22 SECONDS

***    WFOPTS| SIZE OF THE PROGRAM IS   26228/ 168252 kBYTES ***
***     PHFAC| SIZE OF THE PROGRAM IS   26492/ 197268 kBYTES ***
***    ATOMWF| SIZE OF THE PROGRAM IS   37528/ 227624 kBYTES ***
ATRHO| CHARGE(R-SPACE):  388.000000      (G-SPACE):  388.000000

DSYGV : 2538-2015
The number of elements (ARG NO. 11) in a work array must be greater than
or equal to (2124173152).

DSYGV : 2538-2005
The size of the leading dimension (ARG NO. 5) of an array must be greater
than zero.

DSYGV : 2538-2006
The number of rows (ARG NO. 9) of a matrix must be less than or equal
to the size of the leading dimension (ARG NO. 5) of its array.

DSYGV : 2538-2006
The number of rows (ARG NO. 9) of a matrix must be less than or equal
to the size of the leading dimension (ARG NO. 8) of its array.

DSYGV : 2538-2099
End of input argument error reporting. For more information, refer to
Engineering and Scientific Subroutine Library Guide and Reference
(SA22-7904).

DSYGV : 2538-2604
Execution terminating due to error count for error number 2099.

DSYGV : 2538-2605
Message summary:      2005     - 1

DSYGV : 2538-2605
Message summary:      2006     - 2

DSYGV : 2538-2605
Message summary:      2015     - 1

DSYGV : 2538-2605
Message summary:      2099     - 1


best,
joseph hargitai


j
_______________________________________________
CPMD-list mailing list
CPMD-list at cpmd.org
http://cpmd.org/mailman/listinfo/cpmd-list
_______________________________________________
CPMD-list mailing list
CPMD-list at cpmd.org
http://cpmd.org/mailman/listinfo/cpmd-list
_______________________________________________
CPMD-list mailing list
CPMD-list at cpmd.org
http://cpmd.org/mailman/listinfo/cpmd-list

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://cpmd.org/pipermail/cpmd-list/attachments/20061208/44734a8d/attachment.html 


More information about the CPMD-list mailing list