KIT | KIT-Bibliothek | Impressum | Datenschutz

Optimization of single node load balancing for lattice Boltzmann method on heterogeneous high performance computers

Kummerländer, Adrian ORCID iD icon 1; Bukreev, Fedor ORCID iD icon 2; Teutscher, Dennis 2; Dorn, Marcio; Krause, Mathias J. 1,2
1 Institut für Angewandte und Numerische Mathematik (IANM), Karlsruher Institut für Technologie (KIT)
2 Institut für Mechanische Verfahrenstechnik und Mechanik (MVM), Karlsruher Institut für Technologie (KIT)

Abstract:

Lattice Boltzmann Methods (LBM) are particularly suited for highly parallel computational fluid dynamics simulations on heterogeneous HPC systems combining CPUs and GPUs. However, the computationally dominant collide-and-stream loops commonly utilize only GPUs, leaving CPU resources underutilized. To overcome this limitation, this article proposes a novel load balancing strategy based on a genetic algorithm for bottom-up, cost-aware optimization of spatial domain decompositions. This approach generates subdomains and rank assignments inherently suited for cooperative execution on both CPUs and GPUs. Implemented in the open source framework OpenLB, the strategy is applied to turbulent flow reference cases, including a multi-physics reactive mixer. A detailed evaluation on heterogeneous HPC nodes demonstrates significant performance gains, achieving speedups of up to 87% compared to traditional GPU-only execution. This work therefore establishes cost-aware, bottom-up decomposition as a suitable strategy for exploiting the native heterogeneity of modern compute nodes.


Verlagsausgabe §
DOI: 10.5445/IR/1000193856
Veröffentlicht am 03.06.2026
Originalveröffentlichung
DOI: 10.1016/j.jpdc.2025.105169
Scopus
Zitationen: 2
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Angewandte und Numerische Mathematik (IANM)
Institut für Mechanische Verfahrenstechnik und Mechanik (MVM)
Publikationstyp Zeitschriftenaufsatz
Publikationsmonat/-jahr 12.2025
Sprache Englisch
Identifikator ISSN: 0743-7315
KITopen-ID: 1000193856
Erschienen in Journal of Parallel and Distributed Computing
Verlag Elsevier
Band 206
Seiten Art.-Nr. 105169
Nachgewiesen in Scopus
OpenAlex
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page