[CPMD-list] ovlap2_c
Juerg Hutter
hutter at pci.unizh.ch
Fri May 30 10:50:45 CEST 2003
Hi Thierry
usually xGEMM routines perform at the highest possible
rate on all computers. However, there might be the following
problems.
A bad implementation of ZGEMM on this computer. You could test
this by comparing to the DGEMM performance.
The other possibility is that in the FEMD code you call
ZGEMM very often with small dimensions. The VPP 5000 is
a vector machine that might not achieve peak performance
in this case.
A possible solution in this case would be to add special
code for N=1,2,... . There is already a special
code for the case of Matrix*Vector multiplication instead
of Matrix*Matrix (N=1).
greetings
Juerg
----------------------------------------------------------
Juerg Hutter Phone : ++41 1 635 4491
Physical Chemistry Institute FAX : ++41 1 635 6838
University of Zurich E-mail: hutter at pci.unizh.ch
Winterthurerstrasse 190
CH-8057 Zurich, Switzerland
----------------------------------------------------------
On Tue, 27 May 2003, Thierry Deutsch wrote:
> Hello,
>
> For an aluminum interface (174 atoms, cutoff of 10Ry 310 states, free energy
> functional), I have the following behavior on a Fujitsu VPP 5000:
>
> SUBROUTINE CALLS CPU TIME ELAPSED TIME
> OVLAP2_C 9060 30066.36 30067.66
> EHPSI_C 8457 9696.90 9740.52
> FFT-G/S 3038922 4757.54 4842.34
> FRIESNER_C 240 4010.71 4021.63
> S_FWFFT 725161 3545.42 3609.48
> S_INVFFT 794300 3022.41 3091.97
> OVLAP_H 881 2686.38 2686.41
> CALC_BILN 8457 2379.40 2379.68
> EVPSI 515741 1011.01 1033.20
> RGS_C 639 638.15 638.18
> VBETA 240 526.16 538.59
>
> You can see that ovalp2_C is used a lot.
> In this routine ZGEMM is used. I think it is not possible to optimize this
> routine.
> May I wrong?
>
> Au revoir,
>
> Thierry
> _______________________________________________
> CPMD-list mailing list
> CPMD-list at cpmd.org
> http://www.cpmd.org/mailman/listinfo/cpmd-list
>
More information about the CPMD-list
mailing list