KIT | KIT-Bibliothek | Impressum | Datenschutz

Scalable Distributed String Sorting

Kurpicz, Florian 1; Mehnert, Pascal ; Sanders, Peter ORCID iD icon 1; Schimek, Matthias ORCID iD icon 1; Chan, Timothy [Hrsg.]; Fischer, Johannes [Hrsg.]; Iacono, John [Hrsg.]; Herman, Grzegorz [Hrsg.]
1 Institut für Theoretische Informatik (ITI), Karlsruher Institut für Technologie (KIT)

Abstract:

String sorting is an important part of tasks such as building index data structures. Unfortunately, current string sorting algorithms do not scale to massively parallel distributed-memory machines since they either have latency (at least) proportional to the number of processors p or communicate the data a large number of times (at least logarithmic). We present practical and efficient algorithms for distributed-memory string sorting that scale to large p. Similar to state-of-the-art sorters for atomic objects, the algorithms have latency of about p^{1/k} when allowing the data to be communicated k times. Experiments indicate good scaling behavior on a wide range of inputs on up to 49152 cores. Overall, we achieve speedups of up to 4.9 over the current state-of-the-art distributed string sorting algorithms.


Verlagsausgabe §
DOI: 10.5445/IR/1000174674
Veröffentlicht am 04.10.2024
Originalveröffentlichung
DOI: 10.4230/LIPIcs.ESA.2024.83
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Theoretische Informatik (ITI)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 23.09.2024
Sprache Englisch
Identifikator KITopen-ID: 1000174674
Erschienen in 32nd Annual European Symposium on Algorithms (ESA 2024), London, 2nd - 4th September 2024, Ed.: T. Chan, J. Fischer, I. Fischer, J. Grzegorz, H. Grzegorz
Veranstaltung 32nd Annual European Symposium on Algorithms (ESA 2024), London, Vereinigtes Königreich, 02.09.2024 – 04.09.2024
Verlag Schloss Dagstuhl - Leibniz-Zentrum für Informatik (LZI)
Seiten Art.-Nr.: 83
Serie Leibniz International Proceedings in Informatics (LIPIcs) ; 308
Schlagwörter sorting, strings, distributed-memory computing, distributed membership filters, scalability, Theory of computation → Sorting and searching, Theory of computation → Massively parallel algorithms, Computing methodologies → Distributed algorithms, Theory of computation → Bloom filters and hashing
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page