KIT | KIT-Bibliothek | Impressum | Datenschutz

Engineering Minimal k-Perfect Hash Functions

Hermann, Stefan ORCID iD icon 1; Kirmayer, Sebastian 2; Lehmann, Hans-Peter ORCID iD icon 1; Sanders, Peter ORCID iD icon 1; Walzer, Stefan ORCID iD icon 1; Benoit, Anne [Hrsg.]; Kaplan, Haim [Hrsg.]; Wild, Sebastian [Hrsg.]; Herman, Grzegorz [Hrsg.]
1 Institut für Theoretische Informatik (ITI), Karlsruher Institut für Technologie (KIT)
2 Karlsruher Institut für Technologie (KIT)

Abstract:

Given a set $S$ of $n$ keys, a $k$-perfect hash function (kPHF) is a data structure that maps the keys to the first m integers, where each output integer can be hit by at most k input keys. When $m = ⌈n/k⌉$, the resulting function is called a minimal k-perfect hash function (MkPHF). Applications of kPHFs can be found in external memory data structures or to create efficient 1-perfect hash functions, which in turn have a wide range of applications from databases to bioinformatics.
Several papers from the 1980s look at external memory data structures with small internal memory indexes. However, actual $k$-perfect hash functions are surprisingly rare, and the area has not seen a lot of research recently. At the same time, recent research in 1-perfect hashing shows that there is a lack of efficient kPHFs. In this paper, we revive the area of $k$-perfect hashing, presenting four new constructions. Our implementations simultaneously dominate older approaches in space consumption, construction time, and query time. We see this paper as a possible starting point of an active line of research, similar to the area of 1-perfect hashing.


Verlagsausgabe §
DOI: 10.5445/IR/1000185751
Veröffentlicht am 15.10.2025
Originalveröffentlichung
DOI: 10.4230/LIPIcs.ESA.2025.99
Scopus
Zitationen: 1
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Theoretische Informatik (ITI)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 01.10.2025
Sprache Englisch
Identifikator ISSN: 1868-8969
KITopen-ID: 1000185751
HGF-Programm 46.21.02 (POF IV, LK 01) Cross-Domain ATMLs and Research Groups
Erschienen in 33rd Annual European Symposium on Algorithms (ESA 2025), Warschau, 15th-17th September 2025
Veranstaltung 33rd Annual European Symposium on Algorithms (ESA 2025), Warschau, Polen, 15.09.2025 – 17.09.2025
Verlag Schloss Dagstuhl - Leibniz-Zentrum für Informatik (LZI)
Serie Leibniz International Proceedings in Informatics (LIPIcs) ; 356
Schlagwörter Compressed Data Structures, Perfect Hashing, Theory of computation → Data compression, Theory of computation → Bloom filters and hashing, Information systems → Point lookups
Nachgewiesen in Scopus
OpenAlex
Globale Ziele für nachhaltige Entwicklung Ziel 7 – Bezahlbare und saubere EnergieZiel 9 – Industrie, Innovation und InfrastrukturZiel 17 – Partnerschaften zur Erreichung der Ziele
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page