Reducing execution times of Quantum ESPRESSO PWscf calculations by routing matrix multiplication calls to GPUs
We report significant decreases in execution times of plane-wave self consistent field (PWscf) calculations on the software suite Quantum ESPRESSO (QE) by routing general matrix multiplication calls to graphical processing units. The approach described here does not require source code modifications nor recompilation. Running the standard benchmark test AUSURF112 on a single 2.0 Ghz core with two threads showed that using two commodity GPU cards can halve PWscf execution times.
By submitting their manuscript to the Samahang Pisika ng Pilipinas (SPP) for consideration, the Authors warrant that their work is original, does not infringe on existing copyrights, and is not under active consideration for publication elsewhere.
Upon acceptance of their manuscript, the Authors further agree to grant SPP the non-exclusive, worldwide, and royalty-free rights to record, edit, copy, reproduce, publish, distribute, and use all or part of the manuscript for any purpose, in any media now existing or developed in the future, either individually or as part of a collection.
All other associated economic and moral rights as granted by the Intellectual Property Code of the Philippines are maintained by the Authors.