KIT | KIT-Bibliothek | Impressum | Datenschutz

Fast and Space-Efficient Perfect Hashing

Lehmann, Hans-Peter ORCID iD icon 1
1 Institut für Theoretische Informatik (ITI), Karlsruher Institut für Technologie (KIT)

Abstract (englisch):

From data analytics to machine learning, large amounts of input data become more and more important. The volume of collected data grows continuously. Compact data structures enable efficient access to this data and help with processing. Perfect hashing is a fundamental basis for many compact data structures and is used for example in databases, hash tables, retrieval data structures, and approximate membership data structures. A perfect hash function (PHF) maps a set $S$ of $n$ keys to the first $m$ integers without collisions, and is called minimal perfect (MPHF) if $m=n$. It may map keys that are not in $S$ to an arbitrary value. This makes it possible to store the function without representing the set $S$ itself.

In this dissertation, we present three MPHF construction algorithms. With SIMDRecSplit, we focus on space efficiency, scratching the space lower bound of representing MPHFs. We enhance the existing construction RecSplit through parallelism on many levels - bits, vectors, multicores, and the GPU. As a base case, RecSplit uses brute-force search - here we achieve impressive speedups by replacing several retries with simple bit operations or even table lookups. ... mehr


Volltext §
DOI: 10.5445/IR/1000176432
Veröffentlicht am 20.11.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Theoretische Informatik (ITI)
Publikationstyp Hochschulschrift
Publikationsdatum 20.11.2024
Sprache Englisch
Identifikator KITopen-ID: 1000176432
Verlag Karlsruher Institut für Technologie (KIT)
Umfang vii, 167 S.
Art der Arbeit Dissertation
Fakultät Fakultät für Informatik (INFORMATIK)
Institut Institut für Theoretische Informatik (ITI)
Prüfungsdatum 24.10.2024
Projektinformation ScAlBox (EU, H2020, 882500)
Referent/Betreuer Sanders, Peter
Pagh, Rasmus
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page