Due to the high efficiency of the MPI parallelization in CPMD for a small to medium number of nodes and the fact all modern MPI libraries are able to take advantage from shared memory communication, using the plain MPI parallelization is usually the fastest option. The mixed shared/distributed memory parallelization is of most use, if you run a job on a large number of SMP nodes, when the distributed memory parallelization has reached its scalability limit or the network is overloaded (see above). To learn more about the mixed parallelization scheme of CPMD consult reference [6].