Probing optimisation in physics-informed neural networks
Nayara Fonseca, Veronica Guidetti, et al.
ICLR 2023
In this article, we present the algorithmic adaptation and code re-engineering required for porting highly successful and popular planewave codes to next-generation heterogeneous OpenPOWER architectures that foster acceleration and high bandwidth links to GPUs. Here we focus on CPMD as the most representative software for ab initio molecular dynamics simulations. We have ported the construction of the electronic density, the application of the potential to the wavefunctions and the orthogonalization procedure to the GPU. The different GPU kernels consist mainly of fast Fourier transforms (FFT) and basic linear algebra operations (BLAS). The performance of the new implementation obtained on Firestone (POWER8/Tesla) is discussed. We show that the communication between the host and the GPU contributes a large fraction of the total run time. We expect a strong attenuation of the communication bottleneck when the NVLink high-speed interconnect will be available.
Nayara Fonseca, Veronica Guidetti, et al.
ICLR 2023
Shantanu Mishra, Manuel Vilas-Varela, et al.
ACS Nano
Leo Gross, Fabian Paschke, et al.
DPG Spring Meeting 2025
Alexandre Choquette, Agustin Di Paolo, et al.
APS March Meeting 2020