KIT | KIT-Bibliothek | Impressum | Datenschutz

Evaluating asynchronous Schwarz solvers on GPUs

Nayak, Pratik ORCID iD icon; Cojean, Terry ORCID iD icon; Anzt, Hartwig ORCID iD icon


With the commencement of the exascale computing era, we realize that the majority of the leadership supercomputers are heterogeneous and massively parallel even on a single node with multiple co-processors such as GPUs and multiple cores on each node. For example, ORNLs Summit accumulates six NVIDIA Tesla V100s and 42 core IBM Power9s on each node. Synchronizing across all these compute resources in a single node or even across multiple nodes is prohibitively expensive. Hence it is necessary to develop and study asynchronous algorithms that circumvent this issue of bulk-synchronous computing for massive parallelism. In this study, we examine the asynchronous version of the abstract Restricted Additive Schwarz method as a solver where we do not explicitly synchronize, but allow for communication of the data between the sub-domains to be completely asynchronous thereby removing the bulk synchronous nature of the algorithm.
We accomplish this by using the onesided RMA functions of the MPI standard. We study the benefits of using such an asynchronous solver over its synchronous counterpart on both multi-core architectures and on multiple GPUs. ... mehr

Zugehörige Institution(en) am KIT Scientific Computing Center (SCC)
Universität Karlsruhe (TH) – Zentrale Einrichtungen (Zentrale Einrichtungen)
Publikationstyp Forschungsbericht/Preprint
Publikationsjahr 2020
Sprache Englisch
Identifikator ISSN: 1094-3420, 1741-2846
KITopen-ID: 1000124571
HGF-Programm 46.11.01 (POF III, LK 01) Computational Science and Mathematical Methods
Vorab online veröffentlicht am 10.08.2020
Nachgewiesen in arXiv
Relationen in KITopen
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page